Sciweavers

1262 search results - page 107 / 253
» Reinforcement Learning: An Introduction
Sort
View
NIPS
2001
13 years 9 months ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...
AAAI
1998
13 years 9 months ago
Applying Online Search Techniques to Continuous-State Reinforcement Learning
In this paper, we describe methods for e ciently computing better solutions to control problems in continuous state spaces. We provide algorithms that exploit online search to boo...
Scott Davies, Andrew Y. Ng, Andrew W. Moore
ICANN
2010
Springer
13 years 9 months ago
Exploring Continuous Action Spaces with Diffusion Trees for Reinforcement Learning
We propose a new approach for reinforcement learning in problems with continuous actions. Actions are sampled by means of a diffusion tree, which generates samples in the continuou...
Christian Vollmer, Erik Schaffernicht, Horst-Micha...
GECON
2008
Springer
134views Business» more  GECON 2008»
13 years 9 months ago
Rational Bidding Using Reinforcement Learning
The application of autonomous agents by the provisioning and usage of computational resources is an attractive research field. Various methods and technologies in the area of arti...
Nikolay Borissov, Arun Anandasivam, Niklas Wirstr&...
NN
2006
Springer
127views Neural Networks» more  NN 2006»
13 years 8 months ago
The asymptotic equipartition property in reinforcement learning and its relation to return maximization
We discuss an important property called the asymptotic equipartition property on empirical sequences in reinforcement learning. This states that the typical set of empirical seque...
Kazunori Iwata, Kazushi Ikeda, Hideaki Sakai