Search Sciweavers | Sciweavers

181

NIPS
2008

173views Information Technology» more NIPS 2008»

On the asymptotic equivalence between differential Hebbian and temporal difference learning using a local third factor

15 years 8 months ago

In this theoretical contribution we provide mathematical proof that two of the most important classes of network learning - correlation-based differential Hebbian learning and rew...

Christoph Kolodziejski, Bernd Porr, Minija Tamosiu...

claim paper

Read More »

194

click to vote

ICML
2010
IEEE

222views Machine Learning» more ICML 2010»

Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda

15 years 4 months ago

Download www.icml2010.org

Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...

Carlton Downey, Scott Sanner

claim paper

Read More »

210

click to vote

COLING
2010

147views Computational Linguistics» more COLING 2010»

Comparison of different algebras for inducing the temporal structure of texts

15 years 1 months ago

Download aclweb.org

This paper investigates the impact of using different temporal algebras for learning temporal relations between events. Specifically, we compare three intervalbased algebras: Alle...

Pascal Denis, Philippe Muller

claim paper

Read More »

190

Voted

CDC
2010
IEEE

136views Control Systems» more CDC 2010»

Pathologies of temporal difference methods in approximate dynamic programming

15 years 1 months ago

Download web.mit.edu

Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...

Dimitri P. Bertsekas

claim paper

Read More »

167

Voted

ICML
2008
IEEE

165views Machine Learning» more ICML 2008»

A worst-case comparison between temporal difference and residual gradient with linear function approximation

16 years 7 months ago

Download www.research.rutgers.edu

Residual gradient (RG) was proposed as an alternative to TD(0) for policy evaluation when function approximation is used, but there exists little formal analysis comparing them ex...

Lihong Li

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers