Search Sciweavers | Sciweavers

272 search results - page 7 / 55

» Parallel Reinforcement Learning with Linear Function Approxi...

218

Voted

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

16 years 4 days ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

147

click to vote

ESANN
2007

125views Neural Networks» more ESANN 2007»

Replacing eligibility trace for action-value learning with function approximation

15 years 8 months ago

Download www.dice.ucl.ac.be

The eligibility trace is one of the most used mechanisms to speed up reinforcement learning. Earlier reported experiments seem to indicate that replacing eligibility traces would p...

Kary Främling

claim paper

Read More »

208

click to vote

AAAI
2006

161views Intelligent Agents» more AAAI 2006»

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning

15 years 8 months ago

Download staff.science.uva.nl

Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...

Shimon Whiteson, Peter Stone

claim paper

Read More »

176

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

15 years 11 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

174

click to vote

ATAL
2007
Springer

162views Intelligent Agents» more ATAL 2007»

Model-based function approximation in reinforcement learning

16 years 25 days ago

Download userweb.cs.utexas.edu

Reinforcement learning promises a generic method for adapting agents to arbitrary tasks in arbitrary stochastic environments, but applying it to new real-world problems remains di...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

« Prev « First page 7 / 55 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers