Sciweavers

ACL
2012

Deciphering Foreign Language by Combining Language Models and Context Vectors

11 years 6 months ago
Deciphering Foreign Language by Combining Language Models and Context Vectors
In this paper we show how to train statistical machine translation systems on reallife tasks using only non-parallel monolingual data from two languages. We present a modification of the method shown in (Ravi and Knight, 2011) that is scalable to vocabulary sizes of several thousand words. On the task shown in (Ravi and Knight, 2011) we obtain better results with only 5% of the computational effort when running our method with an n-gram language model. The efficiency improvement of our method allows us to run experiments with vocabulary sizes of around 5,000 words, such as a non-parallel version of the VERBMOBIL corpus. We also report results using data from the monolingual French and English GIGAWORD corpora.
Malte Nuhn, Arne Mauser, Hermann Ney
Added 29 Sep 2012
Updated 29 Sep 2012
Type Journal
Year 2012
Where ACL
Authors Malte Nuhn, Arne Mauser, Hermann Ney
Comments (0)