Search Sciweavers | Sciweavers

64 search results - page 8 / 13

» Learning to commit in repeated games

158

click to vote

ICML
2001
IEEE

127views Machine Learning» more ICML 2001»

Convergence of Gradient Dynamics with a Variable Learning Rate

16 years 7 months ago

Download www.cs.cmu.edu

As multiagent environments become more prevalent we need to understand how this changes the agent-based paradigm. One aspect that is heavily affected by the presence of multiple a...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

185

click to vote

AAAI
2008

169views Intelligent Agents» more AAAI 2008»

Perpetual Learning for Non-Cooperative Multiple Agents

15 years 9 months ago

Download www.aaai.org

This paper examines, by argument, the dynamics of sequences of behavioural choices made, when non-cooperative restricted-memory agents learn in partially observable stochastic gam...

Luke Dickens

claim paper

Read More »

165

click to vote

WIAS
2010

109views more WIAS 2010»

Model identification in interactive influence diagrams using mutual information

15 years 1 months ago

Download www.cs.uga.edu

Modeling the perceived behaviors of other agents improves the performance of an agent in multiagent interactions. We utilize the language of interactive influence diagrams to mode...

Yifeng Zeng, Prashant Doshi

claim paper

Read More »

145

click to vote

ICML
2003
IEEE

156views Machine Learning» more ICML 2003»

AWESOME: A General Multiagent Learning Algorithm that Converges in Self-Play and Learns a Best Response Against Stationary Oppon

16 years 7 months ago

Download www-2.cs.cmu.edu

A satisfactory multiagent learning algorithm should, at a minimum, learn to play optimally against stationary opponents and converge to a Nash equilibrium in self-play. The algori...

Vincent Conitzer, Tuomas Sandholm

claim paper

Read More »

167

click to vote

ACL
2009

123views Computational Linguistics» more ACL 2009»

Reinforcement Learning for Mapping Instructions to Actions

15 years 4 months ago

Download www.aclweb.org

In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function tha...

S. R. K. Branavan, Harr Chen, Luke S. Zettlemoyer,...

claim paper

Read More »

« Prev « First page 8 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers