Sciweavers

1235 search results - page 42 / 247
» Reinforcement learning in a nutshell
Sort
View
COLT
2000
Springer
15 years 7 months ago
Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning
We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process (  ¢¡¤£¦¥§  ), and focus on gradient ascent approache...
Peter L. Bartlett, Jonathan Baxter
PKDD
2009
Springer
129views Data Mining» more  PKDD 2009»
15 years 10 months ago
Considering Unseen States as Impossible in Factored Reinforcement Learning
Abstract. The Factored Markov Decision Process (FMDP) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a ...
Olga Kozlova, Olivier Sigaud, Pierre-Henri Wuillem...
132
Voted
IAT
2007
IEEE
15 years 9 months ago
Noise Tolerance in Reinforcement Learning Algorithms
This paper proposes a mechanism of noise tolerance for reinforcement learning algorithms. An adaptive agent that employs reinforcement learning algorithms may receive and accumula...
Richardson Ribeiro, Alessandro L. Koerich, Fabr&ia...
121
Voted
ROBOCUP
2007
Springer
102views Robotics» more  ROBOCUP 2007»
15 years 9 months ago
Heuristic Reinforcement Learning Applied to RoboCup Simulation Agents
This paper describes the design and implementation of robotic agents for the RoboCup Simulation 2D category that learns using a recently proposed Heuristic Reinforcement Learning a...
Luiz A. Celiberto, Carlos H. C. Ribeiro, Anna Hele...
SBIA
2004
Springer
15 years 8 months ago
Heuristically Accelerated Q-Learning: A New Approach to Speed Up Reinforcement Learning
This work presents a new algorithm, called Heuristically Accelerated Q–Learning (HAQL), that allows the use of heuristics to speed up the well-known Reinforcement Learning algori...
Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...