Heuristic Selection of Actions in Multiagent Reinforcement Learning

15 years 8 months ago

Download www.ijcai.org

This work presents a new algorithm, called Heuristically Accelerated Minimax-Q (HAMMQ), that allows the use of heuristics to speed up the wellknown Multiagent Reinforcement Learning algorithm Minimax-Q. A heuristic function H that inﬂuences the choice of the actions characterises the HAMMQ algorithm. This function is associated with a preference policy that indicates that a certain action must be taken instead of another. A set of empirical evaluations were conducted for the proposed algorithm in a simpliﬁed simulator for the robot soccer domain, and experimental results show that even very simple heuristics enhances signiﬁcantly the performance of the multiagent reinforcement learning algorithm.

Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna

Real-time Traffic

Artificial Intelligence | IJCAI 2007 | Multiagent Reinforcement | Reinforcement Learning Algorithm | Wellknown Multiagent Reinforcement |

claim paper

» Asymmetric Multiagent Reinforcement Learning

» Accelerating autonomous learning by using heuristic selection of actions

» Coordinated Reinforcement Learning

» On the Relationship between Learning Capability and the BoltzmannFormula

» Coordination in Multiagent Reinforcement Learning Systems

» SelfOrganizing Cognitive Agents and Reinforcement Learning in MultiAgent Environment

» Collaborative Multiagent Reinforcement Learning by Payoff Propagation

» Integrating Reinforcement Learning Bidding and Genetic Algorithms

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2007
Where	IJCAI
Authors	Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna Helena Reali Costa

Comments (0)

Sciweavers

Heuristic Selection of Actions in Multiagent Reinforcement Learning

Artificial Intelligence | IJCAI 2007 | Multiagent Reinforcement | Reinforcement Learning Algorithm | Wellknown Multiagent Reinforcement |

Explore & Download

Productivity Tools

Sciweavers