Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

95

NIPS
1997

favoriteEmaildiscussreport

94views Information Technology» more NIPS 1997»

Reinforcement Learning with Hierarchies of Machines

15 years 3 months ago

Reinforcement Learning with Hierarchies of Machines

Download www.cs.berkeley.edu

We present a new approach to reinforcement learning in which the policies considered by the learning process are constrained by hierarchies of partially speciﬁed machines. This allows for the use of prior knowledge to reduce the search space and provides a framework in which knowledge can be transferred across problems and in which component solutions can be recombined to solve larger and more complicated problems. Our approach can be seen as providing a link between reinforcement learning and “behavior-based” or “teleo-reactive” approaches to control. We present provably convergent algorithms for problem-solving and learning with hierarchical machines and demonstrate their effectiveness on a problem with several thousand states. Category: reinforcement learning. Preference: plenary.

Ronald Parr, Stuart J. Russell

Real-time Traffic

Learning Process | NIPS 1997 | NIPS 2007 | Prior Knowledge | Reinforcement Learning |

claim paper

Related Content

» Model Approximation for HEXQ Hierarchical Reinforcement Learning

» Discovering Hierarchy in Reinforcement Learning with HEXQ

» The MAXQ Method for Hierarchical Reinforcement Learning

» Automatic discovery and transfer of MAXQ hierarchies

» Measuring Agent Intelligence via Hierarchies of Environments

» Reinforcement learning agents with primary knowledge designed by analytic hierarchy proces...

» RLTOPS An Architecture for Modularity and ReUse in Reinforcement Learning

» Learning to Fly An Application of Hierarchical Reinforcement Learning

» Modelbased Policy Gradient Reinforcement Learning

Post Info
More Details (n/a)

Added	01 Nov 2010
Updated	01 Nov 2010
Type	Conference
Year	1997
Where	NIPS
Authors	Ronald Parr, Stuart J. Russell

Comments (0)