Sciweavers

175 search results - page 29 / 35
» Forgetting Reinforced Cases
Sort
View
ICML
2009
IEEE
14 years 8 months ago
Near-Bayesian exploration in polynomial time
We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...
J. Zico Kolter, Andrew Y. Ng
ICML
2009
IEEE
14 years 8 months ago
Regularization and feature selection in least-squares temporal difference learning
We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...
J. Zico Kolter, Andrew Y. Ng
IROS
2008
IEEE
111views Robotics» more  IROS 2008»
14 years 1 months ago
Learning perceptual coupling for motor primitives
—Dynamic system-based motor primitives [1] have enabled robots to learn complex tasks ranging from Tennisswings to locomotion. However, to date there have been only few extension...
Jens Kober, Betty J. Mohler, Jan Peters
IROS
2008
IEEE
125views Robotics» more  IROS 2008»
14 years 1 months ago
Dynamic correlation matrix based multi-Q learning for a multi-robot system
—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...
Hongliang Guo, Yan Meng
IROS
2007
IEEE
172views Robotics» more  IROS 2007»
14 years 1 months ago
Motor control optimization of compliant one-legged locomotion in rough terrain
— While underactuated robotic systems are capable of energy efficient and rapid dynamic behavior, we still do not fully understand how body dynamics can be actively used for ada...
Fumiya Iida, Russ Tedrake