Search Sciweavers | Sciweavers

53 search results - page 10 / 11

» A Polynomial-time Nash Equilibrium Algorithm for Repeated St...

click to vote

ATAL
2010
Springer

175views Intelligent Agents» more ATAL 2010»

Using counterfactual regret minimization to create competitive multiplayer poker agents

13 years 8 months ago

Download webdocs.cs.ualberta.ca

Games are used to evaluate and advance Multiagent and Artificial Intelligence techniques. Most of these games are deterministic with perfect information (e.g. Chess and Checkers)....

Nicholas Abou Risk, Duane Szafron

claim paper

Read More »

click to vote

ATAL
2006
Springer

147views Intelligent Agents» more ATAL 2006»

Learning to cooperate in multi-agent social dilemmas

13 years 11 months ago

Download sequel.futurs.inria.fr

In many Multi-Agent Systems (MAS), agents (even if selfinterested) need to cooperate in order to maximize their own utilities. Most of the multi-agent learning algorithms focus on...

Jose Enrique Munoz de Cote, Alessandro Lazaric, Ma...

claim paper

Read More »

click to vote

ICML
2003
IEEE

156views Machine Learning» more ICML 2003»

AWESOME: A General Multiagent Learning Algorithm that Converges in Self-Play and Learns a Best Response Against Stationary Oppon

14 years 8 months ago

Download www-2.cs.cmu.edu

A satisfactory multiagent learning algorithm should, at a minimum, learn to play optimally against stationary opponents and converge to a Nash equilibrium in self-play. The algori...

Vincent Conitzer, Tuomas Sandholm

claim paper

Read More »

click to vote

MOBIHOC
2007
ACM

150views Computer Networks» more MOBIHOC 2007»

Distributed opportunistic scheduling for ad-hoc communications: an optimal stopping approach

14 years 7 months ago

Download www.public.asu.edu

We consider distributed opportunistic scheduling (DOS) in wireless ad-hoc networks, where many links contend for the same channel using random access. In such networks, distribute...

Dong Zheng, Weiyan Ge, Junshan Zhang

claim paper

Read More »

click to vote

ICML
2001
IEEE

127views Machine Learning» more ICML 2001»

Convergence of Gradient Dynamics with a Variable Learning Rate

14 years 8 months ago

Download www.cs.cmu.edu

As multiagent environments become more prevalent we need to understand how this changes the agent-based paradigm. One aspect that is heavily affected by the presence of multiple a...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

« Prev « First page 10 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers