Asymmetrically Boosted HMM for Speech Reading

14 years 6 months ago

Download www.cc.gatech.edu

Speech reading, also known as lip reading, is aimed at extracting visual cues of lip and facial movements to aid in recognition of speech. The main hurdle for speech reading is that visual measurements of lip and facial motion lack information-rich features like the Mel frequency cepstral coefficients (MFCC), widely used in acoustic speech recognition. These MFCC are used with hidden Markov models (HMM) in most speech recognition systems at present. Speech reading could greatly benefit from automatic selection and formation of informative features from measurements in the visual domain. These new features can then be used with HMM to capture the dynamics of lip movement and eventual recognition of lip shapes. Towards this end, we use AdaBoost methods for automatic visual feature formation. Specifically, we design an asymmetric variant of AdaBoost M2 algorithm to deal with the ill-posed multi-class sample distribution inherent in our problem. Our experiments show that the boosted HMM a...

Pei Yin, Irfan A. Essa, James M. Rehg

Real-time Traffic

Acoustic Speech Recognition | Computer Vision | CVPR 2004 | HMM Classifiers | Lip Reading | Speech Reading | Speech Recognition Systems |

claim paper

Added	12 Oct 2009
Updated	29 Oct 2009
Type	Conference
Year	2004
Where	CVPR
Authors	Pei Yin, Irfan A. Essa, James M. Rehg

Sciweavers

Asymmetrically Boosted HMM for Speech Reading

Acoustic Speech Recognition | Computer Vision | CVPR 2004 | HMM Classifiers | Lip Reading | Speech Reading | Speech Recognition Systems |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers