Search Sciweavers | Sciweavers

473 search results - page 6 / 95

» Optimal policy switching algorithms for reinforcement learni...

click to vote

ECML
2003
Springer

149views Machine Learning» more ECML 2003»

Could Active Perception Aid Navigation of Partially Observable Grid Worlds?

14 years 27 days ago

Download homepages.inf.ed.ac.uk

Due to the unavoidable fact that a robot’s sensors will be limited in some manner, it is entirely possible that it can ﬁnd itself unable to distinguish between diﬀering state...

Paul A. Crook, Gillian Hayes

claim paper

Read More »

click to vote

ICML
2007
IEEE

141views Machine Learning» more ICML 2007»

Reinforcement learning by reward-weighted regression for operational space control

14 years 8 months ago

Download www.machinelearning.org

Many robot control problems of practical importance, including operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of ...

Jan Peters, Stefan Schaal

claim paper

Read More »

click to vote

ICML
2006
IEEE

156views Machine Learning» more ICML 2006»

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

14 years 8 months ago

Download animatlab.lip6.fr

Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, using Factored Markov Decision Processes (fmdps). However, these algorithms need ...

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

click to vote

ICRA
2010
IEEE

145views Robotics» more ICRA 2010»

Reinforcement learning of motor skills in high dimensions: A path integral approach

13 years 6 months ago

Download www-personal.acfr.usyd.edu.au

— Reinforcement learning (RL) is one of the most general approaches to learning control. Its applicability to complex motor systems, however, has been largely impossible so far d...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

13 years 7 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

« Prev « First page 6 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers