Dynamic correlation matrix based multi-Q learning for a multi-robot system

14 years 1 months ago

Download www.ece.stevens-tech.edu

—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selections, and difficulty in merging learned experiences from other robots. In this paper, we propose a dynamic correlation matrix based multi-Q learning (DCM-MultiQ) method for a distributed multi-robot system. A novel dynamic correlation matrix is proposed, which not only handles each agent’s Q value, but also deals with the correlation among agents. Furthermore, a theoretical proof of the convergence of the proposed DCM-MultiQ algorithm is also provided using a feedback matrix control theory. To evaluate the efficiency of the proposed DCM-MultiQ method, several case studies of a multi-robot system in forage tasks have been conducted. The simulation results show the efficiency and convergence of the proposed method.

Hongliang Guo, Yan Meng

Real-time Traffic

Distributed Multi-robot System | Dynamic Correlation Matrix | IROS 2008 | Nondeterministic Action Selections | Robotics |

claim paper

Post Info
More Details (n/a)

Added	31 May 2010
Updated	31 May 2010
Type	Conference
Year	2008
Where	IROS
Authors	Hongliang Guo, Yan Meng

Comments (0)

Sciweavers

Dynamic correlation matrix based multi-Q learning for a multi-robot system

Distributed Multi-robot System | Dynamic Correlation Matrix | IROS 2008 | Nondeterministic Action Selections | Robotics |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers