Sciweavers

163 search results - page 16 / 33
» Policy Gradient Methods for Robotics
Sort
View
TSMC
1998
82views more  TSMC 1998»
13 years 7 months ago
To push or not to push: on the rearrangement of movable objects by a mobile robot
—We formulate and address the problem of planning a pushing manipulation by a mobile robot which tries to rearrange several movable objects in its work space. We present an algor...
Ohad Ben-Shahar, Ehud Rivlin
IROS
2008
IEEE
146views Robotics» more  IROS 2008»
14 years 2 months ago
Redundancy resolution of a 7 DOF haptic interface considering collision and singularity avoidance
— This paper presents a new redundancy resolution approach based on the gradient projection method, which allows to avoid man-machine and machine-machine collisions as well as si...
Yuta Komoguchi, Ken'ichi Yano, Angelika Peer, Mart...
IROS
2007
IEEE
168views Robotics» more  IROS 2007»
14 years 2 months ago
Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression
Abstract— We propose to improve the locomotive performance of humanoid robots by using approximated biped stepping and walking dynamics with reinforcement learning (RL). Although...
Jun Morimoto, Christopher G. Atkeson, Gen Endo, Go...
ICML
2001
IEEE
14 years 8 months ago
Off-Policy Temporal Difference Learning with Function Approximation
We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...
Doina Precup, Richard S. Sutton, Sanjoy Dasgupta
ICRA
2007
IEEE
132views Robotics» more  ICRA 2007»
14 years 2 months ago
Smooth Feedback for Car-Like Vehicles in Polygonal Environments
— We introduce a method for constructing provably safe smooth feedback laws for car-like robots in obstaclecluttered polygonal environments. The robot is taken to be a point with...
Stephen R. Lindemann, Steven M. LaValle