Search Sciweavers | Sciweavers

411 search results - page 25 / 83

» Learning to Fly: An Application of Hierarchical Reinforcemen...

click to vote

SIAMCO
2000

117views more SIAMCO 2000»

The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning

13 years 7 months ago

Download eprints.iisc.ernet.in

It is shown here that stability of the stochastic approximation algorithm is implied by the asymptotic stability of the origin for an associated ODE. This in turn implies convergen...

Vivek S. Borkar, Sean P. Meyn

claim paper

Read More »

click to vote

CAEPIA
2011
Springer

188views Artificial Intelligence» more CAEPIA 2011»

Evaluating a Reinforcement Learning Algorithm with a General Intelligence Test

12 years 7 months ago

Download users.dsic.upv.es

In this paper we apply the recent notion of anytime universal intelligence tests to the evaluation of a popular reinforcement learning algorithm, Q-learning. We show that a general...

Javier Insa-Cabrera, David L. Dowe, José He...

claim paper

Read More »

click to vote

ICML
2004
IEEE

158views Machine Learning» more ICML 2004»

Adaptive cognitive orthotics: combining reinforcement learning and constraint-based temporal reasoning

14 years 8 months ago

Download www.eecs.umich.edu

Reminder systems support people with impaired prospective memory and/or executive function, by providing them with reminders of their functional daily activities. We integrate tem...

Matthew R. Rudary, Satinder P. Singh, Martha E. Po...

claim paper

Read More »

click to vote

AI
2006
Springer

119views Artificial Intelligence» more AI 2006»

Partial Local FriendQ Multiagent Learning: Application to Team Automobile Coordination Problem

13 years 11 months ago

Download damas.ift.ulaval.ca

Real world multiagent coordination problems are important issues for reinforcement learning techniques. In general, these problems are partially observable and this characteristic ...

Julien Laumonier, Brahim Chaib-draa

claim paper

Read More »

click to vote

IJCNN
2008
IEEE

113views Neural Networks» more IJCNN 2008»

Uncertainty propagation for quality assurance in Reinforcement Learning

14 years 2 months ago

Download www.inb.uni-luebeck.de

— In this paper we address the reliability of policies derived by Reinforcement Learning on a limited amount of observations. This can be done in a principled manner by taking in...

Daniel Schneegaß, Steffen Udluft, Thomas Mar...

claim paper

Read More »

« Prev « First page 25 / 83 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers