Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models

13 years 4 months ago

Download www.fit.vutbr.cz

Although research has previously been done on multilingual speech recognition, it has been found to be very difﬁcult to improve over separately trained systems. The usual approach has been to use some kind of “universal phone set” that covers multiple languages. We report experiments on a different approach to multilingual speech recognition, in which the phone sets are entirely distinct but the model has parameters not tied to speciﬁc states that are shared across languages. We use a model called a “Subspace Gaussian Mixture Model” where states’ distributions are Gaussian Mixture Models with a common structure, constrained to lie in a subspace of the total parameter space. The parameters that deﬁne this subspace can be shared across languages. We obtain substantial WER improvements with this approach, especially with very small amounts of inlanguage training data.

Lukas Burget, Petr Schwarz, Mohit Agarwal, Pinar A

Real-time Traffic

Gaussian Mixture Models | ICASSP 2010 | Multilingual Speech Recognition | Signal Processing | Subspace Gaussian Mixture |

claim paper

» Subspace Gaussian Mixture Models for speech recognition

» An investigation of subspace modeling for phonetic and speaker variability in automatic sp...

» A Hybrid HMMBased Speech Recognizer Using KernelBased Discriminants as Acoustic Models

» Emotion recognition from speech VIA boosted Gaussian mixture models

» The subspace Gaussian mixture model A structured model for speech recognition

» Mixture of Support Vector Machines for HMM based Speech Recognition

» Implicit Trajectory Modeling through Gaussian Transition Models for Speech Recognition

» Factor analysed hidden Markov models for speech recognition

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Lukas Burget, Petr Schwarz, Mohit Agarwal, Pinar Akyazi, Kai Feng, Arnab Ghoshal, Ondrej Glembek, Nagendra K. Goel, Martin Karafiát, Daniel Povey, Ariya Rastrow, Richard C. Rose, Samuel Thomas

Comments (0)

Sciweavers

Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models

Gaussian Mixture Models | ICASSP 2010 | Multilingual Speech Recognition | Signal Processing | Subspace Gaussian Mixture |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers