Search Sciweavers | Sciweavers

223 search results - page 2 / 45

» Least-Squares Temporal Difference Learning

200

Voted

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 8 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

181

click to vote

ICML
2009
IEEE

186views Machine Learning» more ICML 2009»

Regularization and feature selection in least-squares temporal difference learning

16 years 7 months ago

Download ai.stanford.edu

We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

189

click to vote

PKDD
2009
Springer

169views Data Mining» more PKDD 2009»

Hybrid Least-Squares Algorithms for Approximate Policy Evaluation

16 years 1 months ago

Download www.cs.umass.edu

The goal of approximate policy evaluation is to “best” represent a target value function according to a speciﬁc criterion. Temporal difference methods and Bellman residual m...

Jeffrey Johns, Marek Petrik, Sridhar Mahadevan

claim paper

Read More »

188

Voted

SIBGRAPI
2009
IEEE

232views Computer Graphics» more SIBGRAPI 2009»

Learning Discriminative Appearance-Based Models Using Partial Least Squares

16 years 1 months ago

Download www.umiacs.umd.edu

Appearance information is essential for applications such as tracking and people recognition. One of the main problems of using appearance-based discriminative models is the ambig...

William Robson Schwartz, Larry S. Davis

claim paper

Read More »

169

click to vote

NIPS
1998

131views Information Technology» more NIPS 1998»

Lazy Learning Meets the Recursive Least Squares Algorithm

15 years 8 months ago

Download www.swarm-bots.org

Lazy learning is a memory-based technique that, once a query is received, extracts a prediction interpolating locally the neighboring examples of the query which are considered re...

Mauro Birattari, Gianluca Bontempi, Hugues Bersini

claim paper

Read More »

« Prev « First page 2 / 45 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers