Search Sciweavers | Sciweavers

163 search results - page 3 / 33

» Policy Gradient Methods for Robotics

105

click to vote

ICRA
2008
IEEE

129views Robotics» more ICRA 2008»

Compliant manipulation for peg-in-hole: Is passive compliance a key to learn contact motion?

15 years 8 months ago

Download groups.csail.mit.edu

— We examine the usefulness of passive compliance in a manipulator that learns contact motion. Based on the notice that humans outperforms robots with the contact motion, we foll...

Seung-kook Yun

claim paper

Read More »

107

click to vote

RAS
2010

220views more RAS 2010»

Policy gradient learning for quadruped soccer robots

14 years 9 months ago

Download www.irisa.fr

Andrea Cherubini, Francesca Giannone, Luca Iocchi,...

claim paper

Read More »

109

click to vote

ESANN
2008

115views Neural Networks» more ESANN 2008»

15 years 3 months ago

Similarities and differences between policy gradient methods and evolution strategies

Download www.dice.ucl.ac.be

Natural policy gradient methods and the covariance matrix adaptation evolution strategy, two variable metric methods proposed for solving reinforcement learning tasks, are contrast...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

117

click to vote

ESANN
2007

148views Neural Networks» more ESANN 2007»

Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning

15 years 3 months ago

Download www.dice.ucl.ac.be

In this paper, we investigate motor primitive learning with the Natural Actor-Critic approach. The Natural Actor-Critic consists out of actor updates which are achieved using natur...

Jan Peters, Stefan Schaal

claim paper

Read More »

139

click to vote

UAI
2008

234views Artificial Intelligence» more UAI 2008»

Improving Gradient Estimation by Incorporating Sensor Data

15 years 3 months ago

Download www.cs.berkeley.edu

An efficient policy search algorithm should estimate the local gradient of the objective function, with respect to the policy parameters, from as few trials as possible. Whereas m...

Gregory Lawrence, Stuart J. Russell

claim paper

Read More »

« Prev « First page 3 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers