Search Sciweavers | Sciweavers

141 search results - page 9 / 29

» Fuzzy Kanerva-based function approximation for reinforcement...

200

click to vote

ATAL
2008
Springer

133views Intelligent Agents» more ATAL 2008»

Transfer of task representation in reinforcement learning using policy-based proto-value functions

15 years 9 months ago

Download www.aamas-conference.org

Reinforcement Learning research is traditionally devoted to solve single-task problems. Therefore, anytime a new task is faced, learning must be restarted from scratch. Recently, ...

Eliseo Ferrante, Alessandro Lazaric, Marcello Rest...

claim paper

Read More »

188

click to vote

ECML
2004
Springer

154views Machine Learning» more ECML 2004»

Experiments in Value Function Approximation with Sparse Support Vector Regression

16 years 14 days ago

Download userweb.cs.utexas.edu

Abstract. We present ﬁrst experiments using Support Vector Regression as function approximator for an on-line, sarsa-like reinforcement learner. To overcome the batch nature of S...

Tobias Jung, Thomas Uthmann

claim paper

Read More »

201

click to vote

ATAL
2007
Springer

122views Intelligent Agents» more ATAL 2007»

Reducing the complexity of multiagent reinforcement learning

16 years 1 months ago

Download www.damas.ift.ulaval.ca

It is known that the complexity of the reinforcement learning algorithms, such as Q-learning, may be exponential in the number of environment’s states. It was shown, however, th...

Andriy Burkov, Brahim Chaib-draa

claim paper

Read More »

208

click to vote

ECML
2004
Springer

112views Machine Learning» more ECML 2004»

Convergence and Divergence in Standard and Averaging Reinforcement Learning

16 years 14 days ago

Download igitur-archive.library.uu.nl

Although tabular reinforcement learning (RL) methods have been proved to converge to an optimal policy, the combination of particular conventional reinforcement learning techniques...

Marco Wiering

claim paper

Read More »

200

click to vote

GECCO
2004
Springer

122views Optimization» more GECCO 2004»

Gradient-Based Learning Updates Improve XCS Performance in Multistep Problems

16 years 14 days ago

Download www.cs.york.ac.uk

This paper introduces a gradient-based reward prediction update mechanism to the XCS classiﬁer system as applied in neuralnetwork type learning and function approximation mechani...

Martin V. Butz, David E. Goldberg, Pier Luca Lanzi

claim paper

Read More »

« Prev « First page 9 / 29 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers