Hedged learning: regret-minimization with learning experts

16 years 3 months ago

Download people.csail.mit.edu

In non-cooperative multi-agent situations, there cannot exist a globally optimal, yet opponent-independent learning algorithm. Regret-minimization over a set of strategies optimized for potential opponent models is proposed as a good framework for deciding how to behave in such situations. Using longer playing horizons and experts that learn as they play, the regret-minimization framework can be extended to overcome several shortcomings of earlier approaches to the problem of multi-agent learning.

Yu-Han Chang, Leslie Pack Kaelbling

Real-time Traffic

ICML 2005 | Machine Learning | Multi-agent Learning | Non-cooperative Multi-agent Situations | Potential Opponent Models |

claim paper

» Combining expert advice in reactive environments

» Occlusion boundary detection using an online learning framework

» Learning to Order Things

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2005
Where	ICML
Authors	Yu-Han Chang, Leslie Pack Kaelbling

Comments (0)

Sciweavers

Hedged learning: regret-minimization with learning experts

ICML 2005 | Machine Learning | Multi-agent Learning | Non-cooperative Multi-agent Situations | Potential Opponent Models |

Explore & Download

Productivity Tools

Sciweavers