Search Sciweavers | Sciweavers

23

GECCO
2009
Springer

124views Optimization» more GECCO 2009»

Reinforcement learning for games: failures and successes

14 years 11 days ago

We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...

Wolfgang Konen, Thomas Bartz-Beielstein

claim paper

Read More »

21

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

14 years 3 days ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

21

click to vote

PKDD
2009
Springer

129views Data Mining» more PKDD 2009»

Considering Unseen States as Impossible in Factored Reinforcement Learning

14 years 2 months ago

Download www-desir.lip6.fr

Abstract. The Factored Markov Decision Process (FMDP) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a ...

Olga Kozlova, Olivier Sigaud, Pierre-Henri Wuillem...

claim paper

Read More »

19

click to vote

IAT
2007
IEEE

92views Intelligent Agents» more IAT 2007»

Noise Tolerance in Reinforcement Learning Algorithms

14 years 2 months ago

Download www.ppgia.pucpr.br

This paper proposes a mechanism of noise tolerance for reinforcement learning algorithms. An adaptive agent that employs reinforcement learning algorithms may receive and accumula...

Richardson Ribeiro, Alessandro L. Koerich, Fabr&ia...

claim paper

Read More »

21

click to vote

ROBOCUP
2007
Springer

102views Robotics» more ROBOCUP 2007»

Heuristic Reinforcement Learning Applied to RoboCup Simulation Agents

14 years 1 months ago

Download www.fei.edu.br

This paper describes the design and implementation of robotic agents for the RoboCup Simulation 2D category that learns using a recently proposed Heuristic Reinforcement Learning a...

Luiz A. Celiberto, Carlos H. C. Ribeiro, Anna Hele...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers