Search Sciweavers | Sciweavers

1262 search results - page 107 / 253

» Reinforcement Learning: An Introduction

125

Voted

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 5 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

147

Voted

AAAI
1998

181views Intelligent Agents» more AAAI 1998»

Applying Online Search Techniques to Continuous-State Reinforcement Learning

15 years 5 months ago

Download www.autonlab.org

In this paper, we describe methods for e ciently computing better solutions to control problems in continuous state spaces. We provide algorithms that exploit online search to boo...

Scott Davies, Andrew Y. Ng, Andrew W. Moore

claim paper

Read More »

134

Voted

ICANN
2010
Springer

166views Neural Networks» more ICANN 2010»

Exploring Continuous Action Spaces with Diffusion Trees for Reinforcement Learning

15 years 4 months ago

Download www.tu-ilmenau.de

We propose a new approach for reinforcement learning in problems with continuous actions. Actions are sampled by means of a diffusion tree, which generates samples in the continuou...

Christian Vollmer, Erik Schaffernicht, Horst-Micha...

claim paper

Read More »

143

Voted

GECON
2008
Springer

134views Business» more GECON 2008»

Rational Bidding Using Reinforcement Learning

15 years 4 months ago

Download eprints.sics.se

The application of autonomous agents by the provisioning and usage of computational resources is an attractive research ﬁeld. Various methods and technologies in the area of arti...

Nikolay Borissov, Arun Anandasivam, Niklas Wirstr&...

claim paper

Read More »

128

Voted

NN
2006
Springer

127views Neural Networks» more NN 2006»

The asymptotic equipartition property in reinforcement learning and its relation to return maximization

15 years 3 months ago

Download www.ece.uvic.ca

We discuss an important property called the asymptotic equipartition property on empirical sequences in reinforcement learning. This states that the typical set of empirical seque...

Kazunori Iwata, Kazushi Ikeda, Hideaki Sakai

claim paper

Read More »

« Prev « First page 107 / 253 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers