Search Sciweavers | Sciweavers

26

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

14 years 10 months ago

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

33

click to vote

GECCO
2009
Springer

124views Optimization» more GECCO 2009»

Reinforcement learning for games: failures and successes

14 years 2 months ago

Download www.gm.fh-koeln.de

We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...

Wolfgang Konen, Thomas Bartz-Beielstein

claim paper

Read More »

23

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

14 years 2 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

26

click to vote

PKDD
2009
Springer

129views Data Mining» more PKDD 2009»

Considering Unseen States as Impossible in Factored Reinforcement Learning

14 years 4 months ago

Download www-desir.lip6.fr

Abstract. The Factored Markov Decision Process (FMDP) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a ...

Olga Kozlova, Olivier Sigaud, Pierre-Henri Wuillem...

claim paper

Read More »

29

click to vote

IAT
2007
IEEE

92views Intelligent Agents» more IAT 2007»

Noise Tolerance in Reinforcement Learning Algorithms

14 years 4 months ago

Download www.ppgia.pucpr.br

This paper proposes a mechanism of noise tolerance for reinforcement learning algorithms. An adaptive agent that employs reinforcement learning algorithms may receive and accumula...

Richardson Ribeiro, Alessandro L. Koerich, Fabr&ia...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers