Sciweavers

15 search results - page 2 / 3
» Incremental Least-Squares Temporal Difference Learning
Sort
View
ICDM
2009
IEEE
120views Data Mining» more  ICDM 2009»
14 years 2 months ago
Least Square Incremental Linear Discriminant Analysis
Abstract—Linear discriminant analysis (LDA) is a wellknown dimension reduction approach, which projects highdimensional data into a low-dimensional space with the best separation...
Li-Ping Liu, Yuan Jiang, Zhi-Hua Zhou
NIPS
2001
13 years 8 months ago
Model-Free Least-Squares Policy Iteration
We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...
Michail G. Lagoudakis, Ronald Parr
PKDD
2009
Springer
169views Data Mining» more  PKDD 2009»
14 years 2 months ago
Hybrid Least-Squares Algorithms for Approximate Policy Evaluation
The goal of approximate policy evaluation is to “best” represent a target value function according to a specific criterion. Temporal difference methods and Bellman residual m...
Jeffrey Johns, Marek Petrik, Sridhar Mahadevan
NECO
2011
13 years 2 months ago
Least Squares Estimation Without Priors or Supervision
Selection of an optimal estimator typically relies on either supervised training samples (pairs of measurements and their associated true values), or a prior probability model for...
Martin Raphan, Eero P. Simoncelli
ICML
2009
IEEE
14 years 8 months ago
Regularization and feature selection in least-squares temporal difference learning
We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...
J. Zico Kolter, Andrew Y. Ng