Search Sciweavers | Sciweavers

166 search results - page 22 / 34

» Safe exploration for reinforcement learning

click to vote

ICML
2009
IEEE

155views Machine Learning» more ICML 2009»

Near-Bayesian exploration in polynomial time

14 years 9 months ago

Download ai.stanford.edu

We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

click to vote

IHI
2010

197views Healthcare» more IHI 2010»

Beyond safe harbor: automatic discovery of health information de-identification policy alternatives

13 years 3 months ago

Download hiplab.mc.vanderbilt.edu

Regulations in various countries permit the reuse of health information without patient authorization provided the data is "de-identified". In the United States, for ins...

Kathleen Benitez, Grigorios Loukides, Bradley Mali...

claim paper

Read More »

click to vote

NIPS
2003

148views Information Technology» more NIPS 2003»

Approximate Planning in POMDPs with Macro-Actions

13 years 10 months ago

Download books.nips.cc

Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...

Georgios Theocharous, Leslie Pack Kaelbling

claim paper

Read More »

click to vote

CORR
2012
Springer

216views Education» more CORR 2012»

Fractional Moments on Bandit Problems

12 years 4 months ago

Download www.cse.iitm.ac.in

Reinforcement learning addresses the dilemma between exploration to ﬁnd profitable actions and exploitation to act according to the best observations already made. Bandit proble...

Ananda Narayanan B., Balaraman Ravindran

claim paper

Read More »

click to vote

PKDD
2010
Springer

122views Data Mining» more PKDD 2010»

Exploration in Relational Worlds

13 years 7 months ago

Download user.cs.tu-berlin.de

Abstract. One of the key problems in model-based reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large relational domains, in wh...

Tobias Lang, Marc Toussaint, Kristian Kersting

claim paper

Read More »

« Prev « First page 22 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers