Sciweavers

163 search results - page 9 / 33
» Policy Gradient Methods for Robotics
Sort
View
ICRA
2002
IEEE
176views Robotics» more  ICRA 2002»
14 years 26 days ago
Coverage Control for Mobile Sensing Networks
— This paper describes decentralized control laws for the coordination of multiple vehicles performing spatially distributed tasks. The control laws are based on a gradient desce...
Jorge Cortés, Sonia Martínez, Timur ...
NIPS
2007
13 years 9 months ago
Incremental Natural Actor-Critic Algorithms
We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...
Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...
ICRA
2000
IEEE
95views Robotics» more  ICRA 2000»
14 years 9 days ago
A New Redundancy-Based Iterative Scheme for Avoiding Joint Limits Application to Visual Servoing
We propose in this paper new redundancy-based solutions to avoid robot joint limits of a manipulator. We use a control scheme based on the task function approach. We first recall...
François Chaumette, Éric Marchand
IROS
2008
IEEE
144views Robotics» more  IROS 2008»
14 years 2 months ago
Learning nonparametric policies by imitation
— A long cherished goal in artificial intelligence has been the ability to endow a robot with the capacity to learn and generalize skills from watching a human teacher. Such an ...
David B. Grimes, Rajesh P. N. Rao
NN
2010
Springer
187views Neural Networks» more  NN 2010»
13 years 2 months ago
Efficient exploration through active learning for value function approximation in reinforcement learning
Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares ...
Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiya...