Search Sciweavers | Sciweavers

56 search results - page 5 / 12

» Multi-Agent Systems by Incremental Gradient Reinforcement Le...

130

click to vote

ATAL
2008
Springer

115views Intelligent Agents» more ATAL 2008»

Switching dynamics of multi-agent learning

15 years 5 months ago

Download www.ifaamas.org

This paper presents the dynamics of multi-agent reinforcement learning in multiple state problems. We extend previous work that formally modelled the relation between reinforcemen...

Peter Vrancx, Karl Tuyls, Ronald L. Westra

claim paper

Read More »

133

click to vote

ICML
1995
IEEE

184views Machine Learning» more ICML 1995»

Residual Algorithms: Reinforcement Learning with Function Approximation

16 years 4 months ago

Download www.leemon.com

A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...

Leemon C. Baird III

claim paper

Read More »

107

click to vote

UAI
2001

129views Artificial Intelligence» more UAI 2001»

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

15 years 4 months ago

Download cs.anu.edu.au

There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...

Lex Weaver, Nigel Tao

claim paper

Read More »

124

Voted

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 4 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

119

click to vote

CORR
2000
Springer

92views Education» more CORR 2000»

Predicting the expected behavior of agents that learn about agents: the CLRI framework

15 years 3 months ago

Download jmvidal.cse.sc.edu

We describe a framework and equations used to model and predict the behavior of multi-agent systems (MASs) with learning agents. A difference equation is used for calculating the ...

José M. Vidal, Edmund H. Durfee

claim paper

Read More »

« Prev « First page 5 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers