Learning Near-Pareto-Optimal Conventions in Polynomial Time

15 years 9 months ago

Download www-2.cs.cmu.edu

We study how to learn to play a Pareto-optimal strict Nash equilibrium when there exist multiple equilibria and agents may have different preferences among the equilibria. We focus on repeated coordination games of non-identical interest where agents do not know the game structure up front and receive noisy payoffs. We design efﬁcient near-optimal algorithms for both the perfect monitoring and the imperfect monitoring setting(where the agents only observe their own payoffs and the joint actions).

Xiao Feng Wang, Tuomas Sandholm

Real-time Traffic

Multiple Equilibria | NIPS 2003 | NIPS 2007 | Noisy Payoffs | Strict Nash Equilibrium |

claim paper

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2003
Where	NIPS
Authors	Xiao Feng Wang, Tuomas Sandholm

Comments (0)

Sciweavers

Learning Near-Pareto-Optimal Conventions in Polynomial Time

Multiple Equilibria | NIPS 2003 | NIPS 2007 | Noisy Payoffs | Strict Nash Equilibrium |

Explore & Download

Productivity Tools

Sciweavers