Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

112

AAAI
2010

favoriteEmaildiscussreport

171views Intelligent Agents» more AAAI 2010»

Multi-Agent Learning with Policy Prediction

15 years 3 months ago

Multi-Agent Learning with Policy Prediction

Download www.cs.umass.edu

Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting the basic gradient ascent approach with policy prediction. We prove that this augmentation results in a stronger notion of convergence than the basic gradient ascent, that is, strategies converge to a Nash equilibrium within a restricted class of iterated games. Motivated by this augmentation, we then propose a new practical multi-agent reinforcement learning (MARL) algorithm exploiting approximate policy prediction. Empirical results show that it converges faster and in a wider variety of situations than state-of-the-art MARL algorithms.

Chongjie Zhang, Victor R. Lesser

Real-time Traffic

AAAI 2010 | Basic Gradient Ascent | Gradient Ascent Approach | Intelligent Agents | Policy Prediction |

claim paper

Related Content

» An Argumentation Based Approach to MultiAgent Learning

» TimeExtended Policies in MultiAgent Reinforcement Learning

» Bayesian Policy Search for MultiAgent Role Discovery

» Coordinated MultiAgent Reinforcement Learning in Networked Distributed POMDPs

» Strategic Foresighted Learning in Competitive MultiAgent Games

» SelfOrganizing Cognitive Agents and Reinforcement Learning in MultiAgent Environment

» Markov Games as a Framework for MultiAgent Reinforcement Learning

» Extending QLearning to General Adaptive MultiAgent Systems

» The Moving Target Function Problem in MultiAgent Learning

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	AAAI
Authors	Chongjie Zhang, Victor R. Lesser

Comments (0)