Search Sciweavers | Sciweavers

1863 search results - page 14 / 373

» Multiagent learning using a variable learning rate

193

Voted

ATAL
2007
Springer

181views Intelligent Agents» more ATAL 2007»

Multiagent reinforcement learning and self-organization in a network of agents

16 years 26 days ago

Download mas.cs.umass.edu

To cope with large scale, agents are usually organized in a network such that an agent interacts only with its immediate neighbors in the network. Reinforcement learning technique...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

207

Voted

ATAL
2005
Springer

171views Intelligent Agents» more ATAL 2005»

Coordinated exploration in multi-agent reinforcement learning: an application to load-balancing

16 years 7 days ago

Download www.cs.huji.ac.il

This paper is concerned with how multi-agent reinforcement learning algorithms can practically be applied to real-life problems. Recently, a new coordinated multi-agent exploratio...

Katja Verbeeck, Ann Nowé, Karl Tuyls

claim paper

Read More »

165

click to vote

NIPS
2001

121views Information Technology» more NIPS 2001»

Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bias in Reinforcement Learning

15 years 8 months ago

Download books.nips.cc

We address two open theoretical questions in Policy Gradient Reinforcement Learning. The first concerns the efficacy of using function approximation to represent the state action ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

167

click to vote

COLT
1994
Springer

111views Machine Learning» more COLT 1994»

Learning Probabilistic Automata with Variable Memory Length

15 years 10 months ago

Download www.cs.huji.ac.il

We propose and analyze a distribution learning algorithm for variable memory length Markov processes. These processes can be described by a subclass of probabilistic nite automata...

Dana Ron, Yoram Singer, Naftali Tishby

claim paper

Read More »

181

click to vote

LAMAS
2005
Springer

124views Intelligent Agents» more LAMAS 2005»

Unifying Convergence and No-Regret in Multiagent Learning

16 years 5 days ago

Download orca.st.usm.edu

We present a new multiagent learning algorithm, RVσ(t), that builds on an earlier version, ReDVaLeR . ReDVaLeR could guarantee (a) convergence to best response against stationary ...

Bikramjit Banerjee, Jing Peng

claim paper

Read More »

« Prev « First page 14 / 373 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers