Sciweavers

163 search results - page 3 / 33
» Policy Gradient Methods for Robotics
Sort
View
ICRA
2008
IEEE
129views Robotics» more  ICRA 2008»
14 years 1 months ago
Compliant manipulation for peg-in-hole: Is passive compliance a key to learn contact motion?
— We examine the usefulness of passive compliance in a manipulator that learns contact motion. Based on the notice that humans outperforms robots with the contact motion, we foll...
Seung-kook Yun
RAS
2010
220views more  RAS 2010»
13 years 1 months ago
Policy gradient learning for quadruped soccer robots
Andrea Cherubini, Francesca Giannone, Luca Iocchi,...
ESANN
2008
13 years 8 months ago
Similarities and differences between policy gradient methods and evolution strategies
Natural policy gradient methods and the covariance matrix adaptation evolution strategy, two variable metric methods proposed for solving reinforcement learning tasks, are contrast...
Verena Heidrich-Meisner, Christian Igel
ESANN
2007
13 years 8 months ago
Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning
In this paper, we investigate motor primitive learning with the Natural Actor-Critic approach. The Natural Actor-Critic consists out of actor updates which are achieved using natur...
Jan Peters, Stefan Schaal
UAI
2008
13 years 8 months ago
Improving Gradient Estimation by Incorporating Sensor Data
An efficient policy search algorithm should estimate the local gradient of the objective function, with respect to the policy parameters, from as few trials as possible. Whereas m...
Gregory Lawrence, Stuart J. Russell