Search Sciweavers | Sciweavers

1235 search results - page 42 / 247

» Reinforcement learning in a nutshell

107

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 7 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

123

click to vote

PKDD
2009
Springer

129views Data Mining» more PKDD 2009»

Considering Unseen States as Impossible in Factored Reinforcement Learning

15 years 10 months ago

Download www-desir.lip6.fr

Abstract. The Factored Markov Decision Process (FMDP) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a ...

Olga Kozlova, Olivier Sigaud, Pierre-Henri Wuillem...

claim paper

Read More »

132

Voted

IAT
2007
IEEE

92views Intelligent Agents» more IAT 2007»

Noise Tolerance in Reinforcement Learning Algorithms

15 years 9 months ago

Download www.ppgia.pucpr.br

This paper proposes a mechanism of noise tolerance for reinforcement learning algorithms. An adaptive agent that employs reinforcement learning algorithms may receive and accumula...

Richardson Ribeiro, Alessandro L. Koerich, Fabr&ia...

claim paper

Read More »

121

Voted

ROBOCUP
2007
Springer

102views Robotics» more ROBOCUP 2007»

Heuristic Reinforcement Learning Applied to RoboCup Simulation Agents

15 years 9 months ago

Download www.fei.edu.br

This paper describes the design and implementation of robotic agents for the RoboCup Simulation 2D category that learns using a recently proposed Heuristic Reinforcement Learning a...

Luiz A. Celiberto, Carlos H. C. Ribeiro, Anna Hele...

claim paper

Read More »

103

click to vote

SBIA
2004
Springer

137views Artificial Intelligence» more SBIA 2004»

Heuristically Accelerated Q-Learning: A New Approach to Speed Up Reinforcement Learning

15 years 8 months ago

Download www.fei.edu.br

This work presents a new algorithm, called Heuristically Accelerated Q–Learning (HAQL), that allows the use of heuristics to speed up the well-known Reinforcement Learning algori...

Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...

claim paper

Read More »

« Prev « First page 42 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers