Search Sciweavers | Sciweavers

64 search results - page 8 / 13

» Hierarchical Explanation-Based Reinforcement Learning

178

Voted

PKDD
2009
Springer

144views Data Mining» more PKDD 2009»

Compositional Models for Reinforcement Learning

16 years 1 months ago

Download userweb.cs.utexas.edu

Abstract. Innovations such as optimistic exploration, function approximation, and hierarchical decomposition have helped scale reinforcement learning to more complex environments, ...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

216

click to vote

AAAI
1996

191views Intelligent Agents» more AAAI 1996»

Evolution-Based Discovery of Hierarchical Behaviors

15 years 8 months ago

Download www.aaai.org

Procedural representations of control policies have two advantages when facing the scale-up problem in learning tasks. First they are implicit, with potential for inductive genera...

Justinian P. Rosca, Dana H. Ballard

claim paper

Read More »

167

Voted

ICML
2003
IEEE

151views Machine Learning» more ICML 2003»

Hierarchical Policy Gradient Algorithms

16 years 8 months ago

Download www.hpl.hp.com

Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

214

click to vote

ICRA
2006
IEEE

161views Robotics» more ICRA 2006»

Quadruped Robot Obstacle Negotiation via Reinforcement Learning

16 years 1 months ago

Download www.stanford.edu

— Legged robots can, in principle, traverse a large variety of obstacles and terrains. In this paper, we describe a successful application of reinforcement learning to the proble...

Honglak Lee, Yirong Shen, Chih-Han Yu, Gurjeet Sin...

claim paper

Read More »

205

Voted

ICML
2010
IEEE

282views Machine Learning» more ICML 2010»

Bayesian Multi-Task Reinforcement Learning

15 years 8 months ago

Download hal.inria.fr

We consider the problem of multi-task reinforcement learning where the learner is provided with a set of tasks, for which only a small number of samples can be generated for any g...

Alessandro Lazaric, Mohammad Ghavamzadeh

claim paper

Read More »

« Prev « First page 8 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers