Sciweavers

LREC
2008

Holy Moses! Leveraging Existing Tools and Resources for Entity Translation

13 years 6 months ago
Holy Moses! Leveraging Existing Tools and Resources for Entity Translation
Recently, there has been an emphasis on creating shared resources for natural language processing applications. This has resulted in the development of high-quality tools and data, which can then be leveraged by the research community as components for novel systems. In this paper, we reuse an open source machine translation framework to create an Arabic-to-English entity translation system. The system first translates known entity mentions using a standard phrase-based statistical machine translation framework, which is then reused to perform name transliteration on unknown mentions. In order to transliterate names more accurately, we introduce an algorithm to augment a names database with name origin and frequency information from existing data resources. Origin information is used to learn name origin classifiers and origin-specific transliteration models, while frequency information is used to select amongst n-best transliteration candidates. This work demonstrates the feasibility...
Jean Tavernier, Rosa Cowan, Michelle Vanni
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2008
Where LREC
Authors Jean Tavernier, Rosa Cowan, Michelle Vanni
Comments (0)