Search Sciweavers | Sciweavers

1233 search results - page 36 / 247

» Reinforcement Learning in MirrorBot

107

click to vote

SIAMCO
2000

117views more SIAMCO 2000»

The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning

15 years 2 months ago

Download eprints.iisc.ernet.in

It is shown here that stability of the stochastic approximation algorithm is implied by the asymptotic stability of the origin for an associated ODE. This in turn implies convergen...

Vivek S. Borkar, Sean P. Meyn

claim paper

Read More »

148

click to vote

ICASSP
2011
IEEE

153views Signal Processing» more ICASSP 2011»

Reinforcement learning for energy-efficient wireless transmission

14 years 6 months ago

Download mirlab.org

We consider the problem of energy-efficient point-to-point transmission of delay-sensitive data (e.g. multimedia data) over a fading channel. We propose a rigorous and unified fra...

Nicholas Mastronarde, Mihaela van der Schaar

claim paper

Read More »

138

click to vote

CAEPIA
2011
Springer

188views Artificial Intelligence» more CAEPIA 2011»

Evaluating a Reinforcement Learning Algorithm with a General Intelligence Test

14 years 2 months ago

Download users.dsic.upv.es

In this paper we apply the recent notion of anytime universal intelligence tests to the evaluation of a popular reinforcement learning algorithm, Q-learning. We show that a general...

Javier Insa-Cabrera, David L. Dowe, José He...

claim paper

Read More »

140

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 4 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

123

click to vote

AAAI
2000

139views Intelligent Agents» more AAAI 2000»

Localizing Search in Reinforcement Learning

15 years 4 months ago

Download www.cs.colorado.edu

Reinforcement learning (RL) can be impractical for many high dimensional problems because of the computational cost of doing stochastic search in large state spaces. We propose a ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

« Prev « First page 36 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers