Search Sciweavers | Sciweavers

536 search results - page 10 / 108

» Residual Algorithms: Reinforcement Learning with Function Ap...

click to vote

AAAI
2011

202views Intelligent Agents» more AAAI 2011»

Value Function Approximation in Reinforcement Learning Using the Fourier Basis

12 years 7 months ago

Download people.csail.mit.edu

We describe the Fourier Basis, a linear value function approximation scheme based on the Fourier Series. We empirically evaluate its properties, and demonstrate that it performs w...

George Konidaris, Sarah Osentoski, Philip Thomas

claim paper

Read More »

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

14 years 1 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

click to vote

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Reinforcement Learning via AIXI Approximation

13 years 9 months ago

Download jveness.info

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. This approach is based on a direct approximation of AIXI, a Bayesian...

Joel Veness, Kee Siong Ng, Marcus Hutter, David Si...

claim paper

Read More »

click to vote

ICML
2000
IEEE

155views Machine Learning» more ICML 2000»

Combining Reinforcement Learning with a Local Control Algorithm

14 years 8 months ago

Download www-anw.cs.umass.edu

We explore combining reinforcement learning with a hand-crafted local controller in a manner suggested by the chaotic control algorithm of Vincent, Schmitt and Vincent (1994). A c...

Andrew G. Barto, Jette Randløv, Michael T. ...

claim paper

Read More »

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

13 years 9 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

« Prev « First page 10 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers