Generalized model learning for reinforcement learning in factored domains

14 years 6 months ago

Download userweb.cs.utexas.edu

Improving the sample eﬃciency of reinforcement learning algorithms to scale up to larger and more realistic domains is a current research challenge in machine learning. Model-based methods use experiential data more eﬃciently than modelfree approaches but often require exhaustive exploration to learn an accurate model of the domain. We present an algorithm, Reinforcement Learning with Decision Trees (rl-dt), that uses supervised learning techniques to learn the model by generalizing the relative eﬀect of actions across states. Speciﬁcally, rl-dt uses decision trees to model the relative eﬀects of actions in the domain. The agent explores the environment exhaustively in early episodes when its model is inaccurate. Once it believes it has developed an accurate model, it exploits its model, taking the optimal action at each step. The combination of the learning approach with the targeted exploration policy enables fast learning of the model. The sample eﬃciency of the algorit...

Todd Hester, Peter Stone

Real-time Traffic

Artificial Intelligence | ATAL 2009 | Reinforcement Learning | Reinforcement Learning Algorithms | Supervised Learning |

claim paper

Post Info
More Details (n/a)

Added	26 May 2010
Updated	26 May 2010
Type	Conference
Year	2009
Where	ATAL
Authors	Todd Hester, Peter Stone

Comments (0)

Sciweavers

Generalized model learning for reinforcement learning in factored domains

Artificial Intelligence | ATAL 2009 | Reinforcement Learning | Reinforcement Learning Algorithms | Supervised Learning |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers