Search Sciweavers | Sciweavers

2108 search results - page 144 / 422

» Tracking in Reinforcement Learning

111

click to vote

ICML
2003
IEEE

151views Machine Learning» more ICML 2003»

Hierarchical Policy Gradient Algorithms

16 years 3 months ago

Download www.hpl.hp.com

Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

133

Voted

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

15 years 9 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

113

Voted

GECCO
2005
Springer

111views Optimization» more GECCO 2005»

XCS with eligibility traces

15 years 8 months ago

Download www.bcs.rochester.edu

The development of the XCS Learning Classiﬁer System has produced a robust and stable implementation that performs competitively in direct-reward environments. Although investig...

Jan Drugowitsch, Alwyn Barry

claim paper

Read More »

114

Voted

ESANN
2003

152views Neural Networks» more ESANN 2003»

Improving iterative repair strategies for scheduling with the SVM

15 years 4 months ago

Download www2.in.tu-clausthal.de

The resource constraint project scheduling problem (RCPSP) is an NP-hard benchmark problem in scheduling which takes into account the limitation of resources’ availabilities in ...

Kai Gersmann, Barbara Hammer

claim paper

Read More »

139

Voted

UAI
2008

236views Artificial Intelligence» more UAI 2008»

CORL: A Continuous-state Offset-dynamics Reinforcement Learner

15 years 4 months ago

Download uai2008.cs.helsinki.fi

Continuous state spaces and stochastic, switching dynamics characterize a number of rich, realworld domains, such as robot navigation across varying terrain. We describe a reinfor...

Emma Brunskill, Bethany R. Leffler, Lihong Li, Mic...

claim paper

Read More »

« Prev « First page 144 / 422 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers