Policy Gradient Method for Team Markov Games

14 years 6 months ago

Download www.cis.hut.fi

The main aim of this paper is to extend the single-agent policy gradient method for multiagent domains where all agents share the same utility function. We formulate these team problems as Markov games endowed with the asymmetric equilibrium concept and based on this formulation, we provide a direct policy gradient learning method. In addition, we test the proposed method with a small example problem.

Ville Könönen

Real-time Traffic

Asymmetric Equilibrium Concept | Gradient Learning Method | IDEAL 2004 | Policy Gradient Method |

claim paper

Post Info
More Details (n/a)

Added	02 Jul 2010
Updated	02 Jul 2010
Type	Conference
Year	2004
Where	IDEAL
Authors	Ville Könönen

Comments (0)

Sciweavers

Policy Gradient Method for Team Markov Games

Asymmetric Equilibrium Concept | Gradient Learning Method | IDEAL 2004 | Policy Gradient Method |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers