Search Sciweavers | Sciweavers

1233 search results - page 42 / 247

» Reinforcement learning

141

click to vote

ECML
2006
Springer

88views Machine Learning» more ECML 2006»

Reinforcement Learning for MDPs with Constraints

15 years 7 months ago

Download www.peter-geibel.de

In this article, I will consider Markov Decision Processes with two criteria, each defined as the expected value of an infinite horizon cumulative return. The second criterion is e...

Peter Geibel

claim paper

Read More »

221

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

15 years 5 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

167

click to vote

ICML
1995
IEEE

184views Machine Learning» more ICML 1995»

Residual Algorithms: Reinforcement Learning with Function Approximation

16 years 6 months ago

Download www.leemon.com

A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...

Leemon C. Baird III

claim paper

Read More »

193

click to vote

AAMAS
2007
Springer

210views Intelligent Agents» more AAMAS 2007»

Bifurcation Analysis of Reinforcement Learning Agents in the Selten's Horse Game

16 years 3 days ago

Download sequel.futurs.inria.fr

Abstract. The application of reinforcement learning algorithms to multiagent domains may cause complex non-convergent dynamics. The replicator dynamics, commonly used in evolutiona...

Alessandro Lazaric, Jose Enrique Munoz de Cote, Fa...

claim paper

Read More »

155

click to vote

TSMC
2008

229views more TSMC 2008»

A Comprehensive Survey of Multiagent Reinforcement Learning

15 years 5 months ago

Download www.dcsc.tudelft.nl

Multiagent systems are rapidly finding applications in a variety of domains, including robotics, distributed control, telecommunications, and economics. The complexity of many task...

Lucian Busoniu, Robert Babuska, Bart De Schutter

claim paper

Read More »

« Prev « First page 42 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers