Smoothed Sarsa: Reinforcement learning for robot delivery tasks

14 years 9 months ago

Download alumni.media.mit.edu

— Our goal in this work is to make high level decisions for mobile robots. In particular, given a queue of prioritized object delivery tasks, we wish to ﬁnd a sequence of actions in real time to accomplish these tasks efﬁciently. We introduce a novel reinforcement learning algorithm called Smoothed Sarsa that learns a good policy for these delivery tasks by delaying the backup reinforcement step until the uncertainty in the state estimate improves. The state space is modeled by a Dynamic Bayesian Network and updated using a Region-based Particle Filter. We take advantage of the fact that only discrete (topological) representations of entity locations are needed for decision-making, to make the tracking and decision making more efﬁcient. Our experiments show that policy search leads to faster task completion times as well as higher total reward compared to a manually crafted policy. Smoothed Sarsa learns a policy orders of magnitude faster than previous policy search algorithms....

Deepak Ramachandran, Rakesh Gupta

Real-time Traffic

Delivery Tasks | ICRA 2009 | Object Delivery Tasks | Policy Search | Robotics |

claim paper

Post Info
More Details (n/a)

Added	23 May 2010
Updated	23 May 2010
Type	Conference
Year	2009
Where	ICRA
Authors	Deepak Ramachandran, Rakesh Gupta

Comments (0)

Sciweavers

Smoothed Sarsa: Reinforcement learning for robot delivery tasks

Delivery Tasks | ICRA 2009 | Object Delivery Tasks | Policy Search | Robotics |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers