Sciweavers

IDEAL
2004
Springer

Policy Gradient Method for Team Markov Games

14 years 4 months ago
Policy Gradient Method for Team Markov Games
The main aim of this paper is to extend the single-agent policy gradient method for multiagent domains where all agents share the same utility function. We formulate these team problems as Markov games endowed with the asymmetric equilibrium concept and based on this formulation, we provide a direct policy gradient learning method. In addition, we test the proposed method with a small example problem.
Ville Könönen
Added 02 Jul 2010
Updated 02 Jul 2010
Type Conference
Year 2004
Where IDEAL
Authors Ville Könönen
Comments (0)