Multiple-Goal Reinforcement Learning with Modular Sarsa(0)

14 years 2 months ago

Download www.cc.gatech.edu

We present a new algorithm, GM-Sarsa(0), for ﬁnding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processes. According to our formulation different sub-goals are modeled as MDPs that are coupled by the requirement that they share actions. Existing reinforcement learning algorithms address similar problem formulations by ﬁrst ﬁnding optimal policies for the component MDPs, and then merging these into a policy for the composite task. The problem with such methods is that policies that are optimized separately may or may not perform well when they are merged into a composite solution. Instead of searching for optimal policies for the component MDPs in isolation, our approach ﬁnds good policies in the context of the composite task.

Nathan Sprague, Dana H. Ballard

Real-time Traffic

Composite Task | IJCAI 2003 | IJCAI 2007 | Optimal Policies | Reinforcement Learning |

claim paper

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2003
Where	IJCAI
Authors	Nathan Sprague, Dana H. Ballard

Comments (0)

Sciweavers

Multiple-Goal Reinforcement Learning with Modular Sarsa(0)

Composite Task | IJCAI 2003 | IJCAI 2007 | Optimal Policies | Reinforcement Learning |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers