Search Sciweavers | Sciweavers

226 search results - page 6 / 46

» A Convergent Reinforcement Learning Algorithm in the Continu...

click to vote

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

13 years 8 months ago

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

click to vote

CIA
2007
Springer

143views Intelligent Agents» more CIA 2007»

Multi-agent Learning Dynamics: A Survey

14 years 27 days ago

Download michaelkaisers.com

Abstract. In this paper we compare state-of-the-art multi-agent reinforcement learning algorithms in a wide variety of games. We consider two types of algorithms: value iteration a...

H. Jaap van den Herik, Daniel Hennes, Michael Kais...

claim paper

Read More »

click to vote

NIPS
1997

94views Information Technology» more NIPS 1997»

Reinforcement Learning with Hierarchies of Machines

13 years 8 months ago

Download www.cs.berkeley.edu

We present a new approach to reinforcement learning in which the policies considered by the learning process are constrained by hierarchies of partially speciﬁed machines. This ...

Ronald Parr, Stuart J. Russell

claim paper

Read More »

click to vote

CEC
2005
IEEE

98views Artificial Intelligence» more CEC 2005»

XCS with computed prediction in continuous multistep environments

13 years 8 months ago

Download www.eskimo.com

We apply XCS with computed prediction (XCSF) to tackle multistep reinforcement learning problems involving continuous inputs. In essence we use XCSF as a method of generalized rein...

Pier Luca Lanzi, Daniele Loiacono, Stewart W. Wils...

claim paper

Read More »

click to vote

IAT
2007
IEEE

92views Intelligent Agents» more IAT 2007»

Noise Tolerance in Reinforcement Learning Algorithms

14 years 1 months ago

Download www.ppgia.pucpr.br

This paper proposes a mechanism of noise tolerance for reinforcement learning algorithms. An adaptive agent that employs reinforcement learning algorithms may receive and accumula...

Richardson Ribeiro, Alessandro L. Koerich, Fabr&ia...

claim paper

Read More »

« Prev « First page 6 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers