Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

27

ICASSP
2008
IEEE

favoriteEmaildiscussreport

125views Signal Processing» more ICASSP 2008»

Corrected tandem features for acoustic model training

14 years 2 months ago

Corrected tandem features for acoustic model training

Download www.icsi.berkeley.edu

This paper describes a simple method for signiﬁcantly improving Tandem features used to train acoustic models for large-vocabulary speech recognition. The linear activations at the outputs of an MLP classiﬁer were modiﬁed according to known reference labels: where necessary, the activation of the output unit corresponding to the correct phone label was increased in order to make an accurate classiﬁcation. This technique was inspired by another experiment that determined a lower error bound on ASR performance within the Tandem framework. By simulating an idealized classiﬁer with forward-backward phone posterior probabilities, we observed a best-case scenario in which nearly all errors were eliminated. Although this performance is not practically achievable, the experiment demonstrated the validity of the Tandem processing approach and suggested that considerable gains are possible by improving the MLP phone classiﬁer.

Arlo Faria, Nelson Morgan

Real-time Traffic

Correct Phone Label | ICASSP 2008 | Large-vocabulary Speech Recognition | Signal Processing | Tandem |

claim paper

Related Content

» Unsupervised acoustic and language model training with small amounts of labelled data

» Combining five acoustic level modeling methods for automatic speaker age and gender recogn...

» SCARF a segmental conditional random field toolkit for speech recognition

» Framebased acoustic feature integration for speech understanding

» Acoustic Chord Transcription and Key Extraction From Audio Using KeyDependent HMMs Trained...

» Improving acoustic event detection using generalizable visual features and multimodality m...

» Acoustic and Facial Features for Speaker Recognition

» Strategies for modeling reverberant speech in the feature domain

» Robust speaker identification using an auditorybased feature

Post Info
More Details (n/a)

Added	30 May 2010
Updated	30 May 2010
Type	Conference
Year	2008
Where	ICASSP
Authors	Arlo Faria, Nelson Morgan

Comments (0)