Sciweavers

12 search results - page 2 / 3
» Exploiting Parallel Treebanks to Improve Phrase-Based Statis...
Sort
View
EMNLP
2009
13 years 2 months ago
Improved Statistical Machine Translation for Resource-Poor Languages Using Related Resource-Rich Languages
We propose a novel language-independent approach for improving statistical machine translation for resource-poor languages by exploiting their similarity to resource-rich ones. Mo...
Preslav Nakov, Hwee Tou Ng
NLE
2007
180views more  NLE 2007»
13 years 4 months ago
Segmentation and alignment of parallel text for statistical machine translation
We address the problem of extracting bilingual chunk pairs from parallel text to create training sets for statistical machine translation. We formulate the problem in terms of a s...
Yonggang Deng, Shankar Kumar, William Byrne
COLING
2010
12 years 11 months ago
Urdu and Hindi: Translation and sharing of linguistic resources
Hindi and Urdu share a common phonology, morphology and grammar but are written in different scripts. In addition, the vocabularies have also diverged significantly especially in ...
Karthik Visweswariah, Vijil Chenthamarakshan, Nand...
EMNLP
2008
13 years 6 months ago
Improved Sentence Alignment on Parallel Web Pages Using a Stochastic Tree Alignment Model
Parallel web pages are important source of training data for statistical machine translation. In this paper, we present a new approach to sentence alignment on parallel web pages....
Lei Shi, Ming Zhou
EMNLP
2008
13 years 6 months ago
Language and Translation Model Adaptation using Comparable Corpora
Traditionally, statistical machine translation systems have relied on parallel bi-lingual data to train a translation model. While bi-lingual parallel data are expensive to genera...
Matthew G. Snover, Bonnie J. Dorr, Richard M. Schw...