Search Sciweavers | Sciweavers

1236 search results - page 10 / 248

» Opposition-Based Reinforcement Learning

139

click to vote

SIAMCO
2000

117views more SIAMCO 2000»

The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning

15 years 5 months ago

Download eprints.iisc.ernet.in

It is shown here that stability of the stochastic approximation algorithm is implied by the asymptotic stability of the origin for an associated ODE. This in turn implies convergen...

Vivek S. Borkar, Sean P. Meyn

claim paper

Read More »

195

click to vote

ICASSP
2011
IEEE

153views Signal Processing» more ICASSP 2011»

Reinforcement learning for energy-efficient wireless transmission

14 years 9 months ago

Download mirlab.org

We consider the problem of energy-efficient point-to-point transmission of delay-sensitive data (e.g. multimedia data) over a fading channel. We propose a rigorous and unified fra...

Nicholas Mastronarde, Mihaela van der Schaar

claim paper

Read More »

179

click to vote

CAEPIA
2011
Springer

188views Artificial Intelligence» more CAEPIA 2011»

Evaluating a Reinforcement Learning Algorithm with a General Intelligence Test

14 years 6 months ago

Download users.dsic.upv.es

In this paper we apply the recent notion of anytime universal intelligence tests to the evaluation of a popular reinforcement learning algorithm, Q-learning. We show that a general...

Javier Insa-Cabrera, David L. Dowe, José He...

claim paper

Read More »

196

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

15 years 3 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

174

click to vote

WSDM
2012
ACM

214views Data Mining» more WSDM 2012»

Selecting actions for resource-bounded information extraction using reinforcement learning

14 years 1 months ago

Download people.cs.umass.edu

Given a database with missing or uncertain content, our goal is to correct and ﬁll the database by extracting speciﬁc information from a large corpus such as the Web, and to d...

Pallika H. Kanani, Andrew K. McCallum

claim paper

Read More »

« Prev « First page 10 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers