Search Sciweavers | Sciweavers

25 search results - page 2 / 5

» Learning by demonstration in repeated stochastic games

click to vote

AI
2007
Springer

183views Artificial Intelligence» more AI 2007»

Competition and Coordination in Stochastic Games

14 years 1 months ago

Download www.damas.ift.ulaval.ca

Agent competition and coordination are two classical and most important tasks in multiagent systems. In recent years, there was a number of learning algorithms proposed to resolve ...

Andriy Burkov, Abdeslam Boularias, Brahim Chaib-dr...

claim paper

Read More »

click to vote

ICML
2010
IEEE

219views Machine Learning» more ICML 2010»

Convergence, Targeted Optimality, and Safety in Multiagent Learning

13 years 8 months ago

Download www.cs.utexas.edu

This paper introduces a novel multiagent learning algorithm, Convergence with Model Learning and Safety (or CMLeS in short), which achieves convergence, targeted optimality agains...

Doran Chakraborty, Peter Stone

claim paper

Read More »

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

13 years 9 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

click to vote

ICML
2008
IEEE

126views Machine Learning» more ICML 2008»

Strategy evaluation in extensive games with importance sampling

14 years 8 months ago

Download www.cs.ualberta.ca

Typically agent evaluation is done through Monte Carlo estimation. However, stochastic agent decisions and stochastic outcomes can make this approach inefficient, requiring many s...

Michael H. Bowling, Michael Johanson, Neil Burch, ...

claim paper

Read More »

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

13 years 7 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

« Prev « First page 2 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers