Sciweavers

NCI
2004

A comparison between spiking and differentiable recurrent neural networks on spoken digit recognition

14 years 1 months ago
A comparison between spiking and differentiable recurrent neural networks on spoken digit recognition
In this paper we demonstrate that Long Short-Term Memory (LSTM) is a differentiable recurrent neural net (RNN) capable of robustly categorizing timewarped speech data. We measure its performance on a spoken digit identification task, where the data was spike-encoded in such a way that classifying the utterances became a difficult challenge in non-linear timewarping. We find that LSTM gives greatly superior results to an SNN found in the literature, and conclude that the architecture has a place in domains that require the learning of large timewarped datasets, such as automatic speech recognition. KEY WORDS Speech Recognition, LSTM, RNN, SNN, Timewarping
Alex Graves, Nicole Beringer, Jürgen Schmidhu
Added 31 Oct 2010
Updated 31 Oct 2010
Type Conference
Year 2004
Where NCI
Authors Alex Graves, Nicole Beringer, Jürgen Schmidhuber
Comments (0)