Sciweavers

15 search results - page 2 / 3
» Incremental Least-Squares Temporal Difference Learning
Sort
View
122
Voted
ICDM
2009
IEEE
120views Data Mining» more  ICDM 2009»
15 years 10 months ago
Least Square Incremental Linear Discriminant Analysis
Abstract—Linear discriminant analysis (LDA) is a wellknown dimension reduction approach, which projects highdimensional data into a low-dimensional space with the best separation...
Li-Ping Liu, Yuan Jiang, Zhi-Hua Zhou
NIPS
2001
15 years 5 months ago
Model-Free Least-Squares Policy Iteration
We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...
Michail G. Lagoudakis, Ronald Parr
PKDD
2009
Springer
169views Data Mining» more  PKDD 2009»
15 years 10 months ago
Hybrid Least-Squares Algorithms for Approximate Policy Evaluation
The goal of approximate policy evaluation is to “best” represent a target value function according to a specific criterion. Temporal difference methods and Bellman residual m...
Jeffrey Johns, Marek Petrik, Sridhar Mahadevan
NECO
2011
14 years 10 months ago
Least Squares Estimation Without Priors or Supervision
Selection of an optimal estimator typically relies on either supervised training samples (pairs of measurements and their associated true values), or a prior probability model for...
Martin Raphan, Eero P. Simoncelli
ICML
2009
IEEE
16 years 4 months ago
Regularization and feature selection in least-squares temporal difference learning
We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...
J. Zico Kolter, Andrew Y. Ng