On the Difficulty of Achieving Equilibrium in Interactive POMDPs

14 years 3 months ago

Download www.cs.uic.edu

We analyze the asymptotic behavior of agents engaged in an infinite horizon partially observable stochastic game as formalized by the interactive POMDP framework. We show that when agents' initial beliefs satisfy a truth compatibility condition, their behavior converges to a subjective -equilibrium in a finite time, and subjective equilibrium in the limit. This result is a generalization of a similar result in repeated games, to partially observable stochastic games. However, it turns out that the equilibrating process is difficult to demonstrate computationally because of the difficulty in coming up with initial beliefs that are both natural and satisfy the truth compatibility condition. Our results, therefore, shed some negative light on using equilibria as a solution concept for decision making in partially observable stochastic games.

Prashant Doshi, Piotr J. Gmytrasiewicz

Real-time Traffic

AAAI 2006 | Initial Beliefs | Intelligent Agents | Observable Stochastic Games | Truth Compatibility Condition |

claim paper

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2006
Where	AAAI
Authors	Prashant Doshi, Piotr J. Gmytrasiewicz

Comments (0)

Sciweavers

On the Difficulty of Achieving Equilibrium in Interactive POMDPs

AAAI 2006 | Initial Beliefs | Intelligent Agents | Observable Stochastic Games | Truth Compatibility Condition |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers