Sciweavers

ACL
2006

Named Entity Transliteration with Comparable Corpora

13 years 5 months ago
Named Entity Transliteration with Comparable Corpora
In this paper we investigate ChineseEnglish name transliteration using comparable corpora, corpora where texts in the two languages deal in some of the same topics -- and therefore share references to named entities -- but are not translations of each other. We present two distinct methods for transliteration, one approach using phonetic transliteration, and the second using the temporal distribution of candidate pairs. Each of these approaches works quite well, but by combining the approaches one can achieve even better results. We then propose a novel score propagation method that utilizes the co-occurrence of transliteration pairs within document pairs. This propagation method achieves further improvement over the best results from the previous step.
Richard Sproat, Tao Tao, ChengXiang Zhai
Added 30 Oct 2010
Updated 30 Oct 2010
Type Conference
Year 2006
Where ACL
Authors Richard Sproat, Tao Tao, ChengXiang Zhai
Comments (0)