Heuristic Selection of Actions in Multiagent Reinforcement Learning

14 years 2 months ago

Download www.ijcai.org

This work presents a new algorithm, called Heuristically Accelerated Minimax-Q (HAMMQ), that allows the use of heuristics to speed up the wellknown Multiagent Reinforcement Learning algorithm Minimax-Q. A heuristic function H that inﬂuences the choice of the actions characterises the HAMMQ algorithm. This function is associated with a preference policy that indicates that a certain action must be taken instead of another. A set of empirical evaluations were conducted for the proposed algorithm in a simpliﬁed simulator for the robot soccer domain, and experimental results show that even very simple heuristics enhances signiﬁcantly the performance of the multiagent reinforcement learning algorithm.

Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna

Real-time Traffic

Artificial Intelligence | IJCAI 2007 | Multiagent Reinforcement | Reinforcement Learning Algorithm | Wellknown Multiagent Reinforcement |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2007
Where	IJCAI
Authors	Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna Helena Reali Costa

Comments (0)

Sciweavers

Heuristic Selection of Actions in Multiagent Reinforcement Learning

Artificial Intelligence | IJCAI 2007 | Multiagent Reinforcement | Reinforcement Learning Algorithm | Wellknown Multiagent Reinforcement |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers