Search Sciweavers | Sciweavers

272 search results - page 6 / 55

» Parallel Reinforcement Learning with Linear Function Approxi...

163

click to vote

ESANN
2006

114views Neural Networks» more ESANN 2006»

Reducing policy degradation in neuro-dynamic programming

15 years 8 months ago

Download ml.informatik.uni-freiburg.de

We focus on neuro-dynamic programming methods to learn state-action value functions and outline some of the inherent problems to be faced, when performing reinforcement learning in...

Thomas Gabel, Martin Riedmiller

claim paper

Read More »

161

Voted

ICRA
2008
IEEE

113views Robotics» more ICRA 2008»

Reinforcement learning with function approximation for cooperative navigation tasks

16 years 1 months ago

Download gaips.inesc-id.pt

— In this paper, we propose a reinforcement learning approach to address multi-robot cooperative navigation tasks in inﬁnite settings. We propose an algorithm to simultaneously...

Francisco S. Melo, M. Isabel Ribeiro

claim paper

Read More »

205

Voted

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

15 years 10 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

212

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

16 years 6 days ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

187

click to vote

ICCBR
2005
Springer

210views Automated Reasoning» more ICCBR 2005»

CBR for State Value Function Approximation in Reinforcement Learning

16 years 5 days ago

Download ml.informatik.uni-freiburg.de

CBR is one of the techniques that can be applied to the task of approximating a function over high-dimensional, continuous spaces. In Reinforcement Learning systems a learning agen...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

« Prev « First page 6 / 55 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers