Sciweavers

411 search results - page 42 / 83
» Learning to Fly: An Application of Hierarchical Reinforcemen...
Sort
View
CORR
2010
Springer
204views Education» more  CORR 2010»
13 years 6 months ago
Predictive State Temporal Difference Learning
We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identification. In practical applications...
Byron Boots, Geoffrey J. Gordon
AAAI
1996
13 years 9 months ago
Monitoring Frog Communities: An Application of Machine Learning
Automatic recognition of animal vocalisations would be a valuable tool for a variety of biological research and environmental monitoring applications . We report the development o...
Andrew Taylor, Graeme Watson, Gordon Grigg, Hamish...
ROBOCUP
2000
Springer
130views Robotics» more  ROBOCUP 2000»
13 years 11 months ago
Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition
Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...
Yasutake Takahashi, Masanori Takeda, Minoru Asada
JCP
2007
143views more  JCP 2007»
13 years 7 months ago
Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization
Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...
Nicolas Chapados, Yoshua Bengio
IROS
2006
IEEE
113views Robotics» more  IROS 2006»
14 years 1 months ago
Policy Gradient Methods for Robotics
— The aquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-struc...
Jan Peters, Stefan Schaal