Most current machine transliteration systems employ a corpus of known sourcetarget word pairs to train their system, and typically evaluate their systems on a similar corpus. In t...
The task of identifying the language of text or utterances has a number of applications in natural language processing. Language identification has traditionally been approached w...
Abstract. In this paper we propose an approach to deal with the ChineseEnglish cross-language image retrieval problem. Text-based image retrieval and query translation methods were...
We propose several techniques for improving statistical machine translation between closely-related languages with scarce resources. We use character-level translation trained on ...
This paper presents a joint optimization method of a two-step conditional random field (CRF) model for machine transliteration and a fast decoding algorithm for the proposed metho...