Search Sciweavers | Sciweavers

139

SIAMCO
2000

117views more SIAMCO 2000»

The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning

15 years 5 months ago

It is shown here that stability of the stochastic approximation algorithm is implied by the asymptotic stability of the origin for an associated ODE. This in turn implies convergen...

Vivek S. Borkar, Sean P. Meyn

claim paper

Read More »

194

click to vote

ICASSP
2011
IEEE

153views Signal Processing» more ICASSP 2011»

Reinforcement learning for energy-efficient wireless transmission

14 years 9 months ago

Download mirlab.org

We consider the problem of energy-efficient point-to-point transmission of delay-sensitive data (e.g. multimedia data) over a fading channel. We propose a rigorous and unified fra...

Nicholas Mastronarde, Mihaela van der Schaar

claim paper

Read More »

177

click to vote

CAEPIA
2011
Springer

188views Artificial Intelligence» more CAEPIA 2011»

Evaluating a Reinforcement Learning Algorithm with a General Intelligence Test

14 years 6 months ago

Download users.dsic.upv.es

In this paper we apply the recent notion of anytime universal intelligence tests to the evaluation of a popular reinforcement learning algorithm, Q-learning. We show that a general...

Javier Insa-Cabrera, David L. Dowe, José He...

claim paper

Read More »

204

click to vote

NN
2007
Springer

105views Neural Networks» more NN 2007»

Guiding exploration by pre-existing knowledge without modifying reward

15 years 5 months ago

Download www.cs.hut.fi

Reinforcement learning is based on exploration of the environment and receiving reward that indicates which actions taken by the agent are good and which ones are bad. In many app...

Kary Främling

claim paper

Read More »

169

click to vote

JSAC
2010

107views more JSAC 2010»

Online learning in autonomic multi-hop wireless networks for transmitting mission-critical applications

15 years 4 months ago

Download medianetlab.ee.ucla.edu

Abstract—In this paper, we study how to optimize the transmission decisions of nodes aimed at supporting mission-critical applications, such as surveillance, security monitoring,...

Hsien-Po Shiang, Mihaela van der Schaar

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers