Sciweavers

1236 search results - page 10 / 248
» Opposition-Based Reinforcement Learning
Sort
View
SIAMCO
2000
117views more  SIAMCO 2000»
13 years 9 months ago
The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning
It is shown here that stability of the stochastic approximation algorithm is implied by the asymptotic stability of the origin for an associated ODE. This in turn implies convergen...
Vivek S. Borkar, Sean P. Meyn
ICASSP
2011
IEEE
13 years 1 months ago
Reinforcement learning for energy-efficient wireless transmission
We consider the problem of energy-efficient point-to-point transmission of delay-sensitive data (e.g. multimedia data) over a fading channel. We propose a rigorous and unified fra...
Nicholas Mastronarde, Mihaela van der Schaar
CAEPIA
2011
Springer
12 years 9 months ago
Evaluating a Reinforcement Learning Algorithm with a General Intelligence Test
In this paper we apply the recent notion of anytime universal intelligence tests to the evaluation of a popular reinforcement learning algorithm, Q-learning. We show that a general...
Javier Insa-Cabrera, David L. Dowe, José He...
GECCO
2009
Springer
162views Optimization» more  GECCO 2009»
13 years 7 months ago
Uncertainty handling CMA-ES for reinforcement learning
The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...
Verena Heidrich-Meisner, Christian Igel
WSDM
2012
ACM
214views Data Mining» more  WSDM 2012»
12 years 5 months ago
Selecting actions for resource-bounded information extraction using reinforcement learning
Given a database with missing or uncertain content, our goal is to correct and fill the database by extracting specific information from a large corpus such as the Web, and to d...
Pallika H. Kanani, Andrew K. McCallum