Search Sciweavers | Sciweavers

226 search results - page 24 / 46

» A Convergent Reinforcement Learning Algorithm in the Continu...

click to vote

GECCO
2009
Springer

124views Optimization» more GECCO 2009»

Reinforcement learning for games: failures and successes

13 years 12 months ago

Download www.gm.fh-koeln.de

We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...

Wolfgang Konen, Thomas Bartz-Beielstein

claim paper

Read More »

click to vote

NIPS
2008

159views Information Technology» more NIPS 2008»

Policy Search for Motor Primitives in Robotics

13 years 8 months ago

Download www.kyb.tuebingen.mpg.de

Many motor skills in humanoid robotics can be learned using parametrized motor primitives as done in imitation learning. However, most interesting motor learning problems are high...

Jens Kober, Jan Peters

claim paper

Read More »

click to vote

ICRA
2006
IEEE

161views Robotics» more ICRA 2006»

Quadruped Robot Obstacle Negotiation via Reinforcement Learning

14 years 1 months ago

Download www.stanford.edu

— Legged robots can, in principle, traverse a large variety of obstacles and terrains. In this paper, we describe a successful application of reinforcement learning to the proble...

Honglak Lee, Yirong Shen, Chih-Han Yu, Gurjeet Sin...

claim paper

Read More »

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Q-Decomposition for Reinforcement Learning Agents

14 years 8 months ago

Download www.hpl.hp.com

The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and...

Stuart J. Russell, Andrew Zimdars

claim paper

Read More »

click to vote

ICML
2009
IEEE

194views Machine Learning» more ICML 2009»

Binary action search for learning continuous-action control policies

14 years 8 months ago

Download www.intelligence.tuc.gr

Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces are quite common in real-wor...

Jason Pazis, Michail G. Lagoudakis

claim paper

Read More »

« Prev « First page 24 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers