Sciweavers

6 search results - page 2 / 2
» Postponed Updates for Temporal-Difference Reinforcement Lear...
Sort
View
ECAI
2006
Springer
13 years 11 months ago
Least Squares SVM for Least Squares TD Learning
Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...
Tobias Jung, Daniel Polani