Gesture-based Dynamic Bayesian Network for noise robust speech recognition

14 years 9 months ago

Download mirlab.org

Previously we have proposed different models for estimating articulatory gestures and vocal tract variable (TV) trajectories from synthetic speech. We have shown that when deployed on natural speech, such models can help to improve the noise robustness of a hidden Markov model (HMM) based speech recognition system. In this paper we propose a model for estimating TVs trained on natural speech and present a Dynamic Bayesian Network (DBN) based speech recognition architecture that treats vocal tract constriction gestures as hidden variables, eliminating the necessity for explicit gesture recognition. Using the proposed architecture we performed a word recognition task for the noisy data of Aurora2. Significant improvement was observed in using the gestural information as hidden variables in a DBN architecture over using only the mel-frequency cepstral coefficient based HMM or DBN backend. We also compare our results with other noise-robust front ends.

Vikramjit Mitra, Hosung Nam, Carol Y. Espy-Wilson,

Real-time Traffic

ICASSP 2011 | Natural Speech | Signal Processing | Speech Recognition | Vocal Tract |

claim paper

» Robust modeling and recognition of hand gestures with dynamic Bayesian network

» Using a DBN to integrate sparse classification and GMMbased ASR

» Modelling the prepausal lengthening effect for speech recognition a dynamic Bayesian netwo...

» Localization of nonlinguistic events in spontaneous speech by NonNegative Matrix Factoriza...

» A MultiModal MixedState Dynamic Bayesian Network for Robust Meeting Event Recognition from...

Post Info
More Details (n/a)

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Vikramjit Mitra, Hosung Nam, Carol Y. Espy-Wilson, Elliot Saltzman, Louis Goldstein

Comments (0)

Sciweavers

Gesture-based Dynamic Bayesian Network for noise robust speech recognition

ICASSP 2011 | Natural Speech | Signal Processing | Speech Recognition | Vocal Tract |

Explore & Download

Productivity Tools

Sciweavers