Sciweavers

1118 search results - page 8 / 224
» Relational temporal difference learning
Sort
View
AAAI
2006
13 years 10 months ago
Incremental Least-Squares Temporal Difference Learning
Alborz Geramifard, Michael H. Bowling, Richard S. ...
NECO
2010
52views more  NECO 2010»
13 years 7 months ago
Hyperbolically Discounted Temporal Difference Learning
William H. Alexander, Joshua W. Brown
ICML
2008
IEEE
14 years 9 months ago
A worst-case comparison between temporal difference and residual gradient with linear function approximation
Residual gradient (RG) was proposed as an alternative to TD(0) for policy evaluation when function approximation is used, but there exists little formal analysis comparing them ex...
Lihong Li