Search Sciweavers | Sciweavers

411 search results - page 42 / 83

» Learning to Fly: An Application of Hierarchical Reinforcemen...

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

13 years 6 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

click to vote

AAAI
1996

118views Intelligent Agents» more AAAI 1996»

Monitoring Frog Communities: An Application of Machine Learning

13 years 9 months ago

Download www.aaai.org

Automatic recognition of animal vocalisations would be a valuable tool for a variety of biological research and environmental monitoring applications . We report the development o...

Andrew Taylor, Graeme Watson, Gordon Grigg, Hamish...

claim paper

Read More »

click to vote

ROBOCUP
2000
Springer

130views Robotics» more ROBOCUP 2000»

Improvement Continuous Valued Q-learning and Its Application to Vision Guided Behavior Acquisition

13 years 11 months ago

Download www.er.ams.eng.osaka-u.ac.jp

Q-learning, a most widely used reinforcement learning method, normally needs well-defined quantized state and action spaces to converge. This makes it difficult to be applied to re...

Yasutake Takahashi, Masanori Takeda, Minoru Asada

claim paper

Read More »

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

13 years 7 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

click to vote

IROS
2006
IEEE

113views Robotics» more IROS 2006»

Policy Gradient Methods for Robotics

14 years 1 months ago

Download www.cs.utah.edu

— The aquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-struc...

Jan Peters, Stefan Schaal

claim paper

Read More »

« Prev « First page 42 / 83 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers