Sciweavers

ACL
2008

Joint Processing and Discriminative Training for Letter-to-Phoneme Conversion

13 years 6 months ago
Joint Processing and Discriminative Training for Letter-to-Phoneme Conversion
We present a discriminative structureprediction model for the letter-to-phoneme task, a crucial step in text-to-speech processing. Our method encompasses three tasks that have been previously handled separately: input segmentation, phoneme prediction, and sequence modeling. The key idea is online discriminative training, which updates parameters according to a comparison of the current system output to the desired output, allowing us to train all of our components together. By folding the three steps of a pipeline approach into a unified dynamic programming framework, we are able to achieve substantial performance gains. Our results surpass the current state-of-the-art on six publicly available data sets representing four different languages.
Sittichai Jiampojamarn, Colin Cherry, Grzegorz Kon
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2008
Where ACL
Authors Sittichai Jiampojamarn, Colin Cherry, Grzegorz Kondrak
Comments (0)