Sciweavers

119 search results - page 2 / 24
» The DAD Parallel Corpora and their Uses
Sort
View
EMNLP
2011
12 years 5 months ago
Inducing Sentence Structure from Parallel Corpora for Reordering
When translating among languages that differ substantially in word order, machine translation (MT) systems benefit from syntactic preordering—an approach that uses features fro...
John DeNero, Jakob Uszkoreit
COLING
2010
13 years 25 days ago
Extraction of Multi-word Expressions from Small Parallel Corpora
We present a general methodology for extracting multi-word expressions (of various types), along with their translations, from small parallel corpora. We automatically align the p...
Yulia Tsvetkov, Shuly Wintner
EMNLP
2011
12 years 5 months ago
Learning Sentential Paraphrases from Bilingual Parallel Corpora for Text-to-Text Generation
Previous work has shown that high quality phrasal paraphrases can be extracted from bilingual parallel corpora. However, it is not clear whether bitexts are an appropriate resourc...
Juri Ganitkevitch, Chris Callison-Burch, Courtney ...
LREC
2008
109views Education» more  LREC 2008»
13 years 7 months ago
Creating Sentence-Aligned Parallel Text Corpora from a Large Archive of Potential Parallel Text using BITS and Champollion
Parallel text is one of the most valuable resources for development of statistical machine translation systems and other NLP applications. The Linguistic Data Consortium (LDC) has...
Kazuaki Maeda, Xiaoyi Ma, Stephanie Strassel
ECIR
2008
Springer
13 years 7 months ago
Effects of Aligned Corpus Quality and Size in Corpus-Based CLIR
Aligned corpora are often-used resources in CLIR systems. The three qualities of translation corpora that most dramatically affect the performance of a corpus-based CLIR system are...
Tuomas Talvensaari