Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

104

PKDD
2009
Springer

favoriteEmaildiscussreport

144views Data Mining» more PKDD 2009»

Compositional Models for Reinforcement Learning

15 years 8 months ago

Compositional Models for Reinforcement Learning

Download userweb.cs.utexas.edu

Abstract. Innovations such as optimistic exploration, function approximation, and hierarchical decomposition have helped scale reinforcement learning to more complex environments, but these three ideas have rarely been studied together. This paper develops a uniﬁed framework that formalizes these algorithmic contributions as operators on learned models of the environment. Our formalism reveals some synergies among these innovations, and it suggests a straightforward way to compose them. The resulting algorithm, Fitted R-MAXQ, is the ﬁrst to combine the function approximation of ﬁtted algorithms, the eﬃcient model-based exploration of R-MAX, and the hierarchical decompostion of MAXQ.

Nicholas K. Jong, Peter Stone

Real-time Traffic

Data Mining | Function Approximation | Hierarchical Decomposition | Optimistic Exploration | PKDD 2009 |

claim paper

Related Content

» MultipleGoal Reinforcement Learning with Modular Sarsa0

» Learning to Drive a Bicycle Using Reinforcement Learning and Shaping

» Reinforcement learning agents with primary knowledge designed by analytic hierarchy proces...

» Fast Learning in an ActorCritic Architecture with Reward and Punishment

» Multiple ModelBased Reinforcement Learning

» A Modular QLearning Architecture for Manipulator Task Decomposition

» Training Reinforcement Neurocontrollers Using the Polytope Algorithm

» Combining ModelBased MetaReasoning and Reinforcement Learning for Adapting GamePlaying Age...

» Learning from Reinforcement and Advice Using Composite Reward Functions

Post Info
More Details (n/a)

Added	27 May 2010
Updated	27 May 2010
Type	Conference
Year	2009
Where	PKDD
Authors	Nicholas K. Jong, Peter Stone

Comments (0)