Applying Many-to-Many Alignments and Hidden Markov Models to Letter-to-Phoneme Conversion

15 years 1 months ago

Download acl.ldc.upenn.edu

Letter-to-phoneme conversion generally requires aligned training data of letters and phonemes. Typically, the alignments are limited to one-to-one alignments. We present a novel technique of training with many-to-many alignments. A letter chunking bigram prediction manages double letters and double phonemes automatically as opposed to preprocessing with ﬁxed lists. We also apply an HMM method in conjunction with a local classiﬁcation model to predict a global phoneme sequence given a word. The many-to-many alignments result in signiﬁcant improvements over the traditional one-to-one approach. Our system achieves state-of-the-art performance on several languages and data sets.

Sittichai Jiampojamarn, Grzegorz Kondrak, Tarek Sh

Real-time Traffic

Aligned Training Data | Computational Linguistics | Many-to-many Alignments | NAACL 2007 | One-to-one Alignments |

claim paper

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2007
Where	NAACL
Authors	Sittichai Jiampojamarn, Grzegorz Kondrak, Tarek Sherif

Comments (0)

Sciweavers

Applying Many-to-Many Alignments and Hidden Markov Models to Letter-to-Phoneme Conversion

Aligned Training Data | Computational Linguistics | Many-to-many Alignments | NAACL 2007 | One-to-one Alignments |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers