Investigation of full-sequence training of deep belief networks for speech recognition

12 years 11 months ago

Download research.microsoft.com

Recently, Deep Belief Networks (DBNs) have been proposed for phone recognition and were found to achieve highly competitive performance. In the original DBNs, only framelevel information was used for training DBN weights while it has been known for long that sequential or full-sequence information can be helpful in improving speech recognition accuracy. In this paper we investigate approaches to optimizing the DBN weights, state-to-state transition parameters, and language model scores using the sequential discriminative training criterion. We describe and analyze the proposed training algorithm and strategy, and discuss practical issues and how they affect the final results. We show that the DBNs learned using the sequence-based training criterion outperform those with frame-based criterion using both threelayer and six-layer models, but the optimization procedure for the deeper DBN is more difficult for the former criterion.

Abdel-rahman Mohamed, Dong Yu, L. Deng

Real-time Traffic

Discriminative Training Criterion | INTERSPEECH 2010 | Sequence-based Training Criterion | Signal Processing | Training Criterion |

claim paper

Added	18 May 2011
Updated	18 May 2011
Type	Journal
Year	2010
Where	INTERSPEECH
Authors	Abdel-rahman Mohamed, Dong Yu, L. Deng

Sciweavers

Investigation of full-sequence training of deep belief networks for speech recognition

Discriminative Training Criterion | INTERSPEECH 2010 | Sequence-based Training Criterion | Signal Processing | Training Criterion |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers