Sciweavers

163 search results - page 2 / 33
» Policy Gradient Methods for Robotics
Sort
View
CIS
2005
Springer
14 years 17 days ago
An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm
Recently, actor-critic methods have drawn much interests in the area of reinforcement learning, and several algorithms have been studied along the line of the actor-critic strategy...
Jooyoung Park, Jongho Kim, Daesung Kang
IROS
2007
IEEE
123views Robotics» more  IROS 2007»
14 years 1 months ago
An extended policy gradient algorithm for robot task learning
Andrea Cherubini, Francesca Giannone, Luca Iocchi,...
IDEAL
2004
Springer
14 years 11 days ago
Policy Gradient Method for Team Markov Games
The main aim of this paper is to extend the single-agent policy gradient method for multiagent domains where all agents share the same utility function. We formulate these team pro...
Ville Könönen
NIPS
2008
13 years 8 months ago
Policy Search for Motor Primitives in Robotics
Many motor skills in humanoid robotics can be learned using parametrized motor primitives as done in imitation learning. However, most interesting motor learning problems are high...
Jens Kober, Jan Peters
JMLR
2010
189views more  JMLR 2010»
13 years 1 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...