Search Sciweavers | Sciweavers

378 search results - page 20 / 76

» Reinforcement Learning for Online Control of Evolutionary Al...

133

click to vote

IJCNN
2006
IEEE

127views Neural Networks» more IJCNN 2006»

Reinforcement Learning for Parameterized Motor Primitives

15 years 9 months ago

Download www-clmc.usc.edu

Abstract— One of the major challenges in both action generation for robotics and in the understanding of human motor control is to learn the “building blocks of movement genera...

Jan Peters, Stefan Schaal

claim paper

Read More »

109

click to vote

SIAMCO
2000

117views more SIAMCO 2000»

The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning

15 years 3 months ago

Download eprints.iisc.ernet.in

It is shown here that stability of the stochastic approximation algorithm is implied by the asymptotic stability of the origin for an associated ODE. This in turn implies convergen...

Vivek S. Borkar, Sean P. Meyn

claim paper

Read More »

117

click to vote

JMLR
2002

125views more JMLR 2002»

Lyapunov Design for Safe Reinforcement Learning

15 years 2 months ago

Download www-anw.cs.umass.edu

Lyapunov design methods are used widely in control engineering to design controllers that achieve qualitative objectives, such as stabilizing a system or maintaining a system'...

Theodore J. Perkins, Andrew G. Barto

claim paper

Read More »

109

click to vote

ESANN
2006

115views Neural Networks» more ESANN 2006»

Construction of a memory management system in an on-line learning mechanism

15 years 4 months ago

Download www.dice.ucl.ac.be

This paper is the first of a two paper series that deals with an important problem in on-line learning mechanisms for autonomous agents that must perform non trivial tasks and oper...

Francisco Bellas, José Antonio Becerra, Ric...

claim paper

Read More »

136

click to vote

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 4 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

« Prev « First page 20 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers