Search Sciweavers | Sciweavers

165 search results - page 24 / 33

» Exploration and apprenticeship learning in reinforcement lea...

click to vote

RAS
2010

131views more RAS 2010»

Probabilistic Policy Reuse for inter-task transfer learning

13 years 6 months ago

Download scalab.uc3m.es

Policy Reuse is a reinforcement learning technique that eﬃciently learns a new policy by using past similar learned policies. The Policy Reuse learner improves its exploration b...

Fernando Fernández, Javier García, M...

claim paper

Read More »

click to vote

NIPS
2003

148views Information Technology» more NIPS 2003»

Approximate Planning in POMDPs with Macro-Actions

13 years 9 months ago

Download books.nips.cc

Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...

Georgios Theocharous, Leslie Pack Kaelbling

claim paper

Read More »

click to vote

CORR
2012
Springer

216views Education» more CORR 2012»

Fractional Moments on Bandit Problems

12 years 3 months ago

Download www.cse.iitm.ac.in

Reinforcement learning addresses the dilemma between exploration to ﬁnd profitable actions and exploitation to act according to the best observations already made. Bandit proble...

Ananda Narayanan B., Balaraman Ravindran

claim paper

Read More »

click to vote

HYBRID
2005
Springer

102views Control Systems» more HYBRID 2005»

Learning Multi-modal Control Programs

14 years 1 months ago

Download users.ece.gatech.edu

Abstract. Multi-modal control is a commonly used design tool for breaking up complex control tasks into sequences of simpler tasks. In this paper, we show that by viewing the contr...

Tejas R. Mehta, Magnus Egerstedt

claim paper

Read More »

click to vote

ICML
2009
IEEE

155views Machine Learning» more ICML 2009»

Near-Bayesian exploration in polynomial time

14 years 8 months ago

Download ai.stanford.edu

We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 24 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers