Search Sciweavers | Sciweavers

43 search results - page 7 / 9

» The O.D.E. Method for Convergence of Stochastic Approximatio...

click to vote

ICML
2009
IEEE

194views Machine Learning» more ICML 2009»

Binary action search for learning continuous-action control policies

14 years 9 months ago

Download www.intelligence.tuc.gr

Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces are quite common in real-wor...

Jason Pazis, Michail G. Lagoudakis

claim paper

Read More »

click to vote

ECML
2004
Springer

139views Machine Learning» more ECML 2004»

Batch Reinforcement Learning with State Importance

14 years 1 months ago

Download www.research.rutgers.edu

Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classiﬁer mapping states to actions....

Lihong Li, Vadim Bulitko, Russell Greiner

claim paper

Read More »

click to vote

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

14 years 2 months ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

click to vote

ICML
1998
IEEE

179views Machine Learning» more ICML 1998»

Value Function Based Production Scheduling

14 years 9 months ago

Download www.ri.cmu.edu

Production scheduling, the problem of sequentially con guring a factory to meet forecasted demands, is a critical problem throughout the manufacturing industry. The requirement of...

Jeff G. Schneider, Justin A. Boyan, Andrew W. Moor...

claim paper

Read More »

click to vote

SAB
2010
Springer

226views Optimization» more SAB 2010»

Distributed Online Learning of Central Pattern Generators in Modular Robots

13 years 7 months ago

Download modular.mmmi.sdu.dk

Abstract. In this paper we study distributed online learning of locomotion gaits for modular robots. The learning is based on a stochastic approximation method, SPSA, which optimiz...

David Johan Christensen, Alexander Spröwitz, ...

claim paper

Read More »

« Prev « First page 7 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers