Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

106

COLT
2000
Springer

favoriteEmaildiscussreport

121views Machine Learning» more COLT 2000»

Bias-Variance Error Bounds for Temporal Difference Updates

15 years 6 months ago

Bias-Variance Error Bounds for Temporal Difference Updates

Download www.cis.upenn.edu

We give the ﬁrst rigorous upper bounds on the error of temporal difference (td) algorithms for policy evaluation as a function of the amount of experience. These upper bounds prove exponentially fast convergence, with both the rate of convergence and the asymptote strongly dependent on the length of the backups k or the parameter . Our bounds give formal veriﬁcation to the long-standing intuition that td methods are subject to a “bias-variance” trade-off, and they lead to schedules for k and that are predicted to be better than any ﬁxed values for these parameters. We give preliminary experimental conﬁrmation of our theory for a version of the random walk problem.

Michael J. Kearns, Satinder P. Singh

Real-time Traffic

COLT 2000 | Machine Learning | Rigorous Upper Bounds | Temporal Difference | Upper Bounds |

claim paper

Related Content

» On Error Bounds of Finite Difference Approximations to Partial Differential Equations Tem...

» Efficient temporal counting with bounded error

» A setmembership approach to consensus problems with bounded measurement errors

» Multilayered impostors for accelerated rendering

» LTL Path Checking Is Efficiently Parallelizable

» Asymmetric Real Adaboost

» Improving Data Quality Consistency and Accuracy

» Selfcorrelating predictive information tracking for largescale production systems

» Processing DataStream Join Aggregates Using Skimmed Sketches

Post Info
More Details (n/a)

Added	02 Aug 2010
Updated	02 Aug 2010
Type	Conference
Year	2000
Where	COLT
Authors	Michael J. Kearns, Satinder P. Singh

Comments (0)