Sciweavers

878 search results - page 36 / 176
» Learning to Control in Operational Space
Sort
View
118
Voted
IROS
2007
IEEE
147views Robotics» more  IROS 2007»
15 years 10 months ago
Autonomous learning of 3D reaching in a humanoid robot
— In this paper, we describe the implementation of a precise reaching controller on an upper-torso humanoid robot. The solution we propose does not rely on prior models of the ki...
Francesco Nori, Lorenzo Natale, Giulio Sandini, Gi...
133
Voted
CDC
2009
IEEE
132views Control Systems» more  CDC 2009»
15 years 8 months ago
Q-learning and Pontryagin's Minimum Principle
Abstract— Q-learning is a technique used to compute an optimal policy for a controlled Markov chain based on observations of the system controlled using a non-optimal policy. It ...
Prashant G. Mehta, Sean P. Meyn
129
Voted
IJCAI
1989
15 years 4 months ago
Integrating Knowledge-Based System and Neural Network Techniques for Robotic Skill Acquisition
This paper describes an approach to robotic control that is patterned after models of human skill acquisition. The intent is to develop robots capable of learning how to accomplis...
David Handelman, Stephen Lane, Jack Gelfand
125
Voted
ICRA
2005
IEEE
140views Robotics» more  ICRA 2005»
15 years 9 months ago
Fast Reinforcement Learning for Vision-guided Mobile Robots
— This paper presents a new reinforcement learning algorithm for accelerating acquisition of new skills by real mobile robots, without requiring simulation. It speeds up Q-learni...
Tomás Martínez-Marín, Tom Duc...
163
Voted
ICML
2006
IEEE
15 years 9 months ago
Automatic basis function construction for approximate dynamic programming and reinforcement learning
We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...
Philipp W. Keller, Shie Mannor, Doina Precup