The Necessity of Average Rewards in Cooperative Multirobot Learning

15 years 11 months ago

Download www.ri.cmu.edu

Learning can be an effective way for robot systems to deal with dynamic environments and changing task conditions. However, popular singlerobot learning algorithms based on discounted rewards, such as Q learning, do not achieve cooperation (i.e., purposeful division of labor) when applied to task-level multirobot systems. A tasklevel system is defined as one performing a mission that is decomposed into subtasks shared among robots. In this paper, we demonstrate the superiority of average-reward-based learning such as the Monte Carlo algorithm for task-level multirobot systems, and suggest an explanation for this superiority.

Poj Tangamchit, John M. Dolan, Pradeep K. Khosla

Real-time Traffic

ICRA 2002 | Monte Carlo Algorithm | Popular Singlerobot Learning | Robotics | Task-level Multirobot Systems |

claim paper

Added	15 Jul 2010
Updated	15 Jul 2010
Type	Conference
Year	2002
Where	ICRA
Authors	Poj Tangamchit, John M. Dolan, Pradeep K. Khosla

Sciweavers

The Necessity of Average Rewards in Cooperative Multirobot Learning

ICRA 2002 | Monte Carlo Algorithm | Popular Singlerobot Learning | Robotics | Task-level Multirobot Systems |

Explore & Download

Productivity Tools

Sciweavers