Sciweavers

ACL
2012

A Statistical Model for Unsupervised and Semi-supervised Transliteration Mining

11 years 6 months ago
A Statistical Model for Unsupervised and Semi-supervised Transliteration Mining
We propose a novel model to automatically extract transliteration pairs from parallel corpora. Our model is efficient, language pair independent and mines transliteration pairs in a consistent fashion in both unsupervised and semi-supervised settings. We model transliteration mining as an interpolation of transliteration and non-transliteration sub-models. We evaluate on NEWS 2010 shared task data and on parallel corpora with competitive results.
Hassan Sajjad, Alexander Fraser, Helmut Schmid
Added 29 Sep 2012
Updated 29 Sep 2012
Type Journal
Year 2012
Where ACL
Authors Hassan Sajjad, Alexander Fraser, Helmut Schmid
Comments (0)