Training Recurrent Networks by Evolino

13 years 3 months ago

Download www.mindraces.org

In recent years, gradient-based LSTM recurrent neural networks (RNNs) solved many previously RNN-unlearnable tasks. Sometimes, however, gradient information is of little use for training RNNs, due to numerous local minima. For such cases we present a novel method, namely, EVOlution of systems with LINear Outputs (Evolino). Evolino evolves weights to the nonlinear, hidden nodes of RNNs while computing optimal linear mappings from hidden state to output, using methods such as pseudo-inverse-based linear regression. If we instead use quadratic programming to maximize the margin, we obtain the ﬁrst evolutionary recurrent Support Vector Machines. We show that Evolino-based LSTM can solve tasks that Echo State nets [15] cannot, and achieves higher accuracy in certain continuous function generation tasks than conventional gradient descent RNNs, including gradient-based LSTM.

Jürgen Schmidhuber, Daan Wierstra, Matteo Gag

Real-time Traffic

Gradient-based Lstm | NECO 2007 | Numerous Local Minima | Optimal Linear Mappings |

claim paper

» A System for Robotic Heart Surgery that Learns to Tie Knots Using Recurrent Neural Network...

» Modeling systems with internal state using evolino

» Fast training of recurrent networks based on the EM algorithm

» A neurodynamical model for working memory

» An Accelerating Learning Algorithm for BlockDiagonal Recurrent Neural Networks

» Identification of finite state automata with a class of recurrent neural networks

» An EM Based Training Algorithm for Recurrent Neural Networks

» Recurrent neural network based language model

Post Info
More Details (n/a)

Added	27 Dec 2010
Updated	27 Dec 2010
Type	Journal
Year	2007
Where	NECO
Authors	Jürgen Schmidhuber, Daan Wierstra, Matteo Gagliolo, Faustino J. Gomez

Comments (0)

Sciweavers

Training Recurrent Networks by Evolino

Gradient-based Lstm | NECO 2007 | Numerous Local Minima | Optimal Linear Mappings |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers