Sciweavers

64 search results - page 8 / 13
» Learning to commit in repeated games
Sort
View
ICML
2001
IEEE
14 years 7 months ago
Convergence of Gradient Dynamics with a Variable Learning Rate
As multiagent environments become more prevalent we need to understand how this changes the agent-based paradigm. One aspect that is heavily affected by the presence of multiple a...
Michael H. Bowling, Manuela M. Veloso
AAAI
2008
13 years 9 months ago
Perpetual Learning for Non-Cooperative Multiple Agents
This paper examines, by argument, the dynamics of sequences of behavioural choices made, when non-cooperative restricted-memory agents learn in partially observable stochastic gam...
Luke Dickens
WIAS
2010
109views more  WIAS 2010»
13 years 1 months ago
Model identification in interactive influence diagrams using mutual information
Modeling the perceived behaviors of other agents improves the performance of an agent in multiagent interactions. We utilize the language of interactive influence diagrams to mode...
Yifeng Zeng, Prashant Doshi
ICML
2003
IEEE
14 years 7 months ago
AWESOME: A General Multiagent Learning Algorithm that Converges in Self-Play and Learns a Best Response Against Stationary Oppon
A satisfactory multiagent learning algorithm should, at a minimum, learn to play optimally against stationary opponents and converge to a Nash equilibrium in self-play. The algori...
Vincent Conitzer, Tuomas Sandholm
ACL
2009
13 years 4 months ago
Reinforcement Learning for Mapping Instructions to Actions
In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function tha...
S. R. K. Branavan, Harr Chen, Luke S. Zettlemoyer,...