Dynamic Programming for Partially Observable Stochastic Games

14 years 8 months ago

Download anytime.cs.umass.edu

We develop an exact dynamic programming algorithm for partially observable stochastic games (POSGs). The algorithm is a synthesis of dynamic programming for partially observable Markov decision processes (POMDPs) and iterative elimination of dominated strategies in normal form games. We prove that it iteratively eliminates very weakly dominated strategies without first forming the normal form representation of a finite-horizon POSG. This is the first dynamic programming algorithm for iterative strategy elimination in these types of games. For the special case in which agents share the same payoffs, the algorithm can be used to find an optimal solution. We present preliminary empirical results and discuss ways to further exploit POMDP theory in solving POSGs.

Eric A. Hansen, Daniel S. Bernstein, Shlomo Zilber

Real-time Traffic

AAAI 2004 | Dynamic Programming Algorithm | Intelligent Agents | Normal Form | Observable Stochastic Games |

claim paper

» Game theoretic Golog under partial observability

» Pointbased Dynamic Programming for DECPOMDPs

» Relational Partially Observable MDPs

» Approximate Solutions for Partially Observable Stochastic Games with Common Payoffs

» On the Difficulty of Achieving Equilibrium in Interactive POMDPs

» GameTheoretic Agent Programming in Golog Under Partial Observability

» Stochastic Dynamic Thermal Management A Markovian Decisionbased Approach

» The Dynamics of MultiAgent Reinforcement Learning

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2004
Where	AAAI
Authors	Eric A. Hansen, Daniel S. Bernstein, Shlomo Zilberstein

Comments (0)

Sciweavers

Dynamic Programming for Partially Observable Stochastic Games

AAAI 2004 | Dynamic Programming Algorithm | Intelligent Agents | Normal Form | Observable Stochastic Games |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers