Learning to Cooperate via Policy Search

15 years 8 months ago

Download reference.kfupm.edu.sa

Cooperative games are those in which both agents share the same payoff structure. Valuebased reinforcement-learning algorithms, such as variants of Q-learning, have been applied to learning cooperative games, but they only apply when the game state is completely observable to both agents. Policy search methods are a reasonable alternative to value-based methods for partially observable environments. In this paper, we provide a gradient-based distributed policysearch method for cooperative games and compare the notion of local optimum to that of Nash equilibrium. We demonstrate the effectiveness of this method experimentally in a small, partially observable simulated soccer domain.

Leonid Peshkin, Kee-Eung Kim, Nicolas Meuleau, Les

Real-time Traffic

Cooperative Games | Observable Simulated Soccer | UAI 2000 | UAI 2008 | Valuebased Reinforcement-learning Algorithms |

claim paper

» Heterogeneous and Hierarchical Cooperative Learning via Combining Decision Trees

» Cooperative Behavior Acquisition for Mobile Robots in Dynamically Changing Real Worlds Via...

» Bayesian Policy Search for MultiAgent Role Discovery

» Cultivating desired behaviour policy teaching via environmentdynamics tweaks

» Coevolutionary search path planning under constrained informationsharing for a cooperative...

» Improving reinforcement learning function approximators via neuroevolution

» Localizing Search in Reinforcement Learning

» Quadruped Robot Obstacle Negotiation via Reinforcement Learning

Post Info
More Details (n/a)

Added	01 Nov 2010
Updated	01 Nov 2010
Type	Conference
Year	2000
Where	UAI
Authors	Leonid Peshkin, Kee-Eung Kim, Nicolas Meuleau, Leslie Pack Kaelbling

Comments (0)

Sciweavers

Learning to Cooperate via Policy Search

Cooperative Games | Observable Simulated Soccer | UAI 2000 | UAI 2008 | Valuebased Reinforcement-learning Algorithms |

Explore & Download

Productivity Tools

Sciweavers