Search Sciweavers | Sciweavers

23

AAAI
2008

169views Intelligent Agents» more AAAI 2008»

Perpetual Learning for Non-Cooperative Multiple Agents

13 years 9 months ago

This paper examines, by argument, the dynamics of sequences of behavioural choices made, when non-cooperative restricted-memory agents learn in partially observable stochastic gam...

Luke Dickens

claim paper

Read More »

18

click to vote

AIIDE
2008

146views Artificial Intelligence» more AIIDE 2008»

Agent Learning using Action-Dependent Learning Rates in Computer Role-Playing Games

13 years 9 months ago

Download www.aaai.org

We introduce the ALeRT (Action-dependent Learning Rates with Trends) algorithm that makes two modifications to the learning rate and one change to the exploration rate of traditio...

Maria Cutumisu, Duane Szafron, Michael H. Bowling,...

claim paper

Read More »

21

click to vote

GECCO
2009
Springer

124views Optimization» more GECCO 2009»

Reinforcement learning for games: failures and successes

14 years 2 days ago

Download www.gm.fh-koeln.de

We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...

Wolfgang Konen, Thomas Bartz-Beielstein

claim paper

Read More »

26

click to vote

AAMAS
2010
Springer

158views Intelligent Agents» more AAMAS 2010»

Coordinated learning in multiagent MDPs with infinite state-space

13 years 7 months ago

Download gaips.inesc-id.pt

Abstract In this paper we address the problem of simultaneous learning and coordination in multiagent Markov decision problems (MMDPs) with infinite state-spaces. We separate this ...

Francisco S. Melo, M. Isabel Ribeiro

claim paper

Read More »

21

click to vote

LAMAS
2005
Springer

124views Intelligent Agents» more LAMAS 2005»

Unifying Convergence and No-Regret in Multiagent Learning

14 years 28 days ago

Download orca.st.usm.edu

We present a new multiagent learning algorithm, RVσ(t), that builds on an earlier version, ReDVaLeR . ReDVaLeR could guarantee (a) convergence to best response against stationary ...

Bikramjit Banerjee, Jing Peng

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers