Sciweavers

4544 search results - page 120 / 909
» Reinforcement Learning with Time
Sort
View
AAAI
2006
13 years 10 months ago
On the Difficulty of Modular Reinforcement Learning for Real-World Partial Programming
In recent years there has been a great deal of interest in "modular reinforcement learning" (MRL). Typically, problems are decomposed into concurrent subgoals, allowing ...
Sooraj Bhat, Charles Lee Isbell Jr., Michael Matea...
NIPS
2001
13 years 10 months ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...
AAAI
1998
13 years 10 months ago
Applying Online Search Techniques to Continuous-State Reinforcement Learning
In this paper, we describe methods for e ciently computing better solutions to control problems in continuous state spaces. We provide algorithms that exploit online search to boo...
Scott Davies, Andrew Y. Ng, Andrew W. Moore
ICANN
2010
Springer
13 years 10 months ago
Exploring Continuous Action Spaces with Diffusion Trees for Reinforcement Learning
We propose a new approach for reinforcement learning in problems with continuous actions. Actions are sampled by means of a diffusion tree, which generates samples in the continuou...
Christian Vollmer, Erik Schaffernicht, Horst-Micha...
GECON
2008
Springer
134views Business» more  GECON 2008»
13 years 10 months ago
Rational Bidding Using Reinforcement Learning
The application of autonomous agents by the provisioning and usage of computational resources is an attractive research field. Various methods and technologies in the area of arti...
Nikolay Borissov, Arun Anandasivam, Niklas Wirstr&...