Search Sciweavers | Sciweavers

175 search results - page 21 / 35

» Forgetting Reinforced Cases

158

ML
2008
ACM

152views Machine Learning» more ML 2008»

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

15 years 3 months ago

Download hal.inria.fr

Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...

András Antos, Csaba Szepesvári, R&ea...

claim paper

Read More »

115

click to vote

GECCO
2005
Springer

111views Optimization» more GECCO 2005»

XCS with eligibility traces

15 years 8 months ago

Download www.bcs.rochester.edu

The development of the XCS Learning Classiﬁer System has produced a robust and stable implementation that performs competitively in direct-reward environments. Although investig...

Jan Drugowitsch, Alwyn Barry

claim paper

Read More »

129

click to vote

NIPS
2008

159views Information Technology» more NIPS 2008»

Policy Search for Motor Primitives in Robotics

15 years 4 months ago

Download www.kyb.tuebingen.mpg.de

Many motor skills in humanoid robotics can be learned using parametrized motor primitives as done in imitation learning. However, most interesting motor learning problems are high...

Jens Kober, Jan Peters

claim paper

Read More »

132

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 4 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

120

click to vote

ESANN
2003

152views Neural Networks» more ESANN 2003»

Improving iterative repair strategies for scheduling with the SVM

15 years 4 months ago

Download www2.in.tu-clausthal.de

The resource constraint project scheduling problem (RCPSP) is an NP-hard benchmark problem in scheduling which takes into account the limitation of resources’ availabilities in ...

Kai Gersmann, Barbara Hammer

claim paper

Read More »

« Prev « First page 21 / 35 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers