Sciweavers

163 search results - page 12 / 33
» Policy Gradient Methods for Robotics
Sort
View
NIPS
2003
13 years 9 months ago
Distributed Optimization in Adaptive Networks
We develop a protocol for optimizing dynamic behavior of a network of simple electronic components, such as a sensor network, an ad hoc network of mobile devices, or a network of ...
Ciamac Cyrus Moallemi, Benjamin Van Roy
IJCAI
2001
13 years 9 months ago
Robot Weightlifting By Direct Policy Search
This paper describes a method for structuring a robot motor learning task. By designing a suitably parameterized policy, we show that a simple search algorithm, along with biologi...
Michael T. Rosenstein, Andrew G. Barto
CDC
2010
IEEE
136views Control Systems» more  CDC 2010»
13 years 2 months ago
Pathologies of temporal difference methods in approximate dynamic programming
Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...
Dimitri P. Bertsekas
TSMC
1998
126views more  TSMC 1998»
13 years 7 months ago
Decentralized adaptive fuzzy control of robot manipulators
—This paper develops a decentralized adaptive fuzzy control scheme for robot manipulators via a combination of genetic algorithm and gradient method. The controller for each link...
Yaochu Jin
CCIA
2005
Springer
14 years 1 months ago
Direct Policy Search Reinforcement Learning for Robot Control
— This paper proposes a high-level Reinforcement Learning (RL) control system for solving the action selection problem of an autonomous robot. Although the dominant approach, whe...
Andres El-Fakdi, Marc Carreras, Narcís Palo...