Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity

14 years 7 months ago

Download www.coneural.org

The persistent modiﬁcation of synaptic efﬁcacy as a function of the relative timing of pre- and postsynaptic spikes is a phenomenon known as spiketiming-dependent plasticity (STDP). Here we show that the modulation of STDP by a global reward signal leads to reinforcement learning. We ﬁrst derive analytically learning rules involving reward-modulated spike-timing-dependent synaptic and intrinsic plasticity, by applying a reinforcement learning algorithm to the stochastic Spike Response Model of spiking neurons. These rules have several features common to plasticity mechanisms experimentally found in the brain. We then demonstrate in simulations of networks of integrateand-ﬁre neurons the efﬁcacy of two simple learning rules involving modulated STDP. One rule is a direct extension of the standard STDP model (modulated STDP), while the other one involves an eligibility trace stored at each synapse that keeps a decaying memory of the relationships between the recent pairs of pre...

Razvan V. Florian

Real-time Traffic

NECO 2007 | Postsynaptic Spike | Reward Signal | STDP |

claim paper

Post Info
More Details (n/a)

Added	27 Dec 2010
Updated	27 Dec 2010
Type	Journal
Year	2007
Where	NECO
Authors	Razvan V. Florian

Comments (0)

Sciweavers

Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity

NECO 2007 | Postsynaptic Spike | Reward Signal | STDP |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers