A comparison between spiking and differentiable recurrent neural networks on spoken digit recognition

13 years 6 months ago

Download www.informatik.uni-ulm.de

In this paper we demonstrate that Long Short-Term Memory (LSTM) is a differentiable recurrent neural net (RNN) capable of robustly categorizing timewarped speech data. We measure its performance on a spoken digit identification task, where the data was spike-encoded in such a way that classifying the utterances became a difficult challenge in non-linear timewarping. We find that LSTM gives greatly superior results to an SNN found in the literature, and conclude that the architecture has a place in domains that require the learning of large timewarped datasets, such as automatic speech recognition. KEY WORDS Speech Recognition, LSTM, RNN, SNN, Timewarping

Alex Graves, Nicole Beringer, Jürgen Schmidhu

Real-time Traffic

Long Short-Term Memory | NCI 2004 | Neural Networks | Speech Recognition | Timewarped Speech Data |

claim paper

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2004
Where	NCI
Authors	Alex Graves, Nicole Beringer, Jürgen Schmidhuber

Comments (0)

Sciweavers

A comparison between spiking and differentiable recurrent neural networks on spoken digit recognition

Long Short-Term Memory | NCI 2004 | Neural Networks | Speech Recognition | Timewarped Speech Data |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers