On the asymptotic equivalence between differential Hebbian and temporal difference learning using a local third factor

14 years 2 months ago

Download books.nips.cc

In this theoretical contribution we provide mathematical proof that two of the most important classes of network learning - correlation-based differential Hebbian learning and reward-based temporal difference learning - are asymptotically equivalent when timing the learning with a local modulatory signal. This opens the opportunity to consistently reformulate most of the abstract reinforcement learning framework from a correlation based perspective that is more closely related to the biophysics of neurons.

Christoph Kolodziejski, Bernd Porr, Minija Tamosiu

Real-time Traffic

Information Technology | Local Modulatory Signal | NIPS 2008 | Reinforcement Learning Framework | Temporal Difference Learning |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	NIPS
Authors	Christoph Kolodziejski, Bernd Porr, Minija Tamosiunaite, Florentin Wörgötter

Comments (0)

Sciweavers

On the asymptotic equivalence between differential Hebbian and temporal difference learning using a local third factor

Information Technology | Local Modulatory Signal | NIPS 2008 | Reinforcement Learning Framework | Temporal Difference Learning |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers