Search Sciweavers | Sciweavers

536 search results - page 26 / 108

» Residual Algorithms: Reinforcement Learning with Function Ap...

click to vote

SBIA
2004
Springer

137views Artificial Intelligence» more SBIA 2004»

Heuristically Accelerated Q-Learning: A New Approach to Speed Up Reinforcement Learning

14 years 1 months ago

Download www.fei.edu.br

This work presents a new algorithm, called Heuristically Accelerated Q–Learning (HAQL), that allows the use of heuristics to speed up the well-known Reinforcement Learning algori...

Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...

claim paper

Read More »

click to vote

TSMC
2008

132views more TSMC 2008»

Ensemble Algorithms in Reinforcement Learning

13 years 7 months ago

Download people.cs.uu.nl

This paper describes several ensemble methods that combine multiple different reinforcement learning (RL) algorithms in a single agent. The aim is to enhance learning speed and fin...

Marco A. Wiering, Hado van Hasselt

claim paper

Read More »

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation

13 years 9 months ago

Download eprints.pascal-network.org

Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g.,...

Dotan Di Castro, Dmitry Volkinshtein, Ron Meir

claim paper

Read More »

click to vote

GECCO
2009
Springer

82views Optimization» more GECCO 2009»

On the scalability of XCS(F)

14 years 2 months ago

Download www.coboslab.psychologie.uni-wuerzburg.de

Many successful applications have proven the potential of Learning Classiﬁer Systems and the XCS classiﬁer system in particular in datamining, reinforcement learning, and func...

Patrick O. Stalph, Martin V. Butz, David E. Goldbe...

claim paper

Read More »

click to vote

NIPS
2003

148views Information Technology» more NIPS 2003»

Approximate Planning in POMDPs with Macro-Actions

13 years 9 months ago

Download books.nips.cc

Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...

Georgios Theocharous, Leslie Pack Kaelbling

claim paper

Read More »

« Prev « First page 26 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers