Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

12

ICASSP
2010
IEEE

favoriteEmaildiscussreport

140views Signal Processing» more ICASSP 2010»

Improving speech recognition by explicit modeling of phone deletions

13 years 4 months ago

Improving speech recognition by explicit modeling of phone deletions

Download www.cs.ust.hk

In a paper published by Greenberg in 1998, it was said that in conversational speech, phone deletion rate may go as high as 12% whereas syllable deletion rate is about 1%. The ﬁnding prompted a new research direction of syllable modeling for speech recognition. To date, the syllable approach has not yet fulﬁlled its promise. On the other hand, there were few attempts to model phone deletions explicitly in current ASR systems. In this paper, fragmented word models were derived from well-trained cross-word triphone models, and phone deletion was implemented by skip arcs for words consisting of at least four phonemes. An evaluation on CSR-II WSJ1 Hub2 5K task shows that even with this limited implementation of phone deletions in read speech, we obtained a word error rate reduction of 6.73%.

Tom Ko, Brian Mak

Real-time Traffic

Deletion Rate | ICASSP 2010 | Phone Deletion Rate | Phone Deletions | Signal Processing |

claim paper

Related Content

» Phone mismatch penalty matrices for twostage keyword spotting via multipass phone recogniz...

» Modeling spontaneous speech events during recognition

» Corrected tandem features for acoustic model training

» Large vocabulary continuous speech recognition with contextdependent DBNHMMS

» Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture M...

» Error Approximation and Minimum Phone Error Acoustic Model Estimation

» Discriminative training methods for language models using conditional entropy criteria

» Investigation of acoustic units for LVCSR systems

» Gesturebased Dynamic Bayesian Network for noise robust speech recognition

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Tom Ko, Brian Mak

Comments (0)