Corrected tandem features for acoustic model training

13 years 9 months ago
Corrected tandem features for acoustic model training
This paper describes a simple method for significantly improving Tandem features used to train acoustic models for large-vocabulary speech recognition. The linear activations at the outputs of an MLP classifier were modified according to known reference labels: where necessary, the activation of the output unit corresponding to the correct phone label was increased in order to make an accurate classification. This technique was inspired by another experiment that determined a lower error bound on ASR performance within the Tandem framework. By simulating an idealized classifier with forward-backward phone posterior probabilities, we observed a best-case scenario in which nearly all errors were eliminated. Although this performance is not practically achievable, the experiment demonstrated the validity of the Tandem processing approach and suggested that considerable gains are possible by improving the MLP phone classifier.
Arlo Faria, Nelson Morgan
Added 30 May 2010
Updated 30 May 2010
Type Conference
Year 2008
Authors Arlo Faria, Nelson Morgan
Comments (0)