Sciweavers

1340 search results - page 12 / 268
» Kalman Temporal Differences
Sort
View
ICML
1999
IEEE
14 years 8 months ago
Least-Squares Temporal Difference Learning
Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...
Justin A. Boyan
ISDA
2009
IEEE
14 years 2 months ago
Postponed Updates for Temporal-Difference Reinforcement Learning
This paper presents postponed updates, a new strategy for TD methods that can improve sample efficiency without incurring the computational and space requirements of model-based ...
Harm van Seijen, Shimon Whiteson
ECAI
2008
Springer
13 years 9 months ago
Using Decision Trees as the Answer Networks in Temporal Difference-Networks
Laura-Andreea Antanas, Kurt Driessens, Jan Ramon, ...