Sciweavers

NAACL
2007

Applying Many-to-Many Alignments and Hidden Markov Models to Letter-to-Phoneme Conversion

13 years 6 months ago
Applying Many-to-Many Alignments and Hidden Markov Models to Letter-to-Phoneme Conversion
Letter-to-phoneme conversion generally requires aligned training data of letters and phonemes. Typically, the alignments are limited to one-to-one alignments. We present a novel technique of training with many-to-many alignments. A letter chunking bigram prediction manages double letters and double phonemes automatically as opposed to preprocessing with fixed lists. We also apply an HMM method in conjunction with a local classification model to predict a global phoneme sequence given a word. The many-to-many alignments result in significant improvements over the traditional one-to-one approach. Our system achieves state-of-the-art performance on several languages and data sets.
Sittichai Jiampojamarn, Grzegorz Kondrak, Tarek Sh
Added 30 Oct 2010
Updated 30 Oct 2010
Type Conference
Year 2007
Where NAACL
Authors Sittichai Jiampojamarn, Grzegorz Kondrak, Tarek Sherif
Comments (0)