Sciweavers

340 search results - page 10 / 68
» Kernelized value function approximation for reinforcement le...
Sort
View
NIPS
1996
15 years 3 months ago
Multidimensional Triangulation and Interpolation for Reinforcement Learning
Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...
Scott Davies
ICPR
2006
IEEE
16 years 3 months ago
Control Double Inverted Pendulum by Reinforcement Learning with Double CMAC Network
To accelerate the learning of reinforcement learning, many types of function approximation are used to represent state value. However function approximation reduces the accuracy o...
Siwei Luo, Yu Zheng, Ziang Lv
115
Voted
NIPS
2001
15 years 3 months ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...
90
Voted
ICML
2008
IEEE
16 years 3 months ago
An analysis of reinforcement learning with function approximation
Francisco S. Melo, Sean P. Meyn, M. Isabel Ribeiro
112
Voted
ICML
2008
IEEE
16 years 3 months ago
Reinforcement learning in the presence of rare events
We consider the task of reinforcement learning in an environment in which rare significant events occur independently of the actions selected by the controlling agent. If these ev...
Jordan Frank, Shie Mannor, Doina Precup