Sciweavers

136 search results - page 1 / 28
» Automatic Alignment in Parallel Corpora
Sort
View
ACL
1994
13 years 6 months ago
Automatic Alignment in Parallel Corpora
This paper addresses the alignment issue in the framework of exploitation of large bimultilingual corpora for translation purposes. A generic alignment scheme is proposed that can...
Harris Papageorgiou, Lambros Cranias, Stelios Pipe...
ICCPOL
2009
Springer
13 years 9 months ago
Constructing Parallel Corpus from Movie Subtitles
Abstract. This paper describes a methodology for constructing aligned German-Chinese corpora from movie subtitles. The corpora will be used to train a special machine translation s...
Han Xiao, Xiaojie Wang
LREC
2008
115views Education» more  LREC 2008»
13 years 6 months ago
Experiments on Processing Overlapping Parallel Corpora
The number and sizes of parallel corpora keep growing, which makes it necessary to have automatic methods of processing them: combining, checking and improving corpora quality, et...
Mark Fishel, Heiki Jaan Kaalep
NAACL
2010
13 years 3 months ago
Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment
The quality of a statistical machine translation (SMT) system is heavily dependent upon the amount of parallel sentences used in training. In recent years, there have been several...
Jason R. Smith, Chris Quirk, Kristina Toutanova
COLING
2010
13 years 2 days ago
Extraction of Multi-word Expressions from Small Parallel Corpora
We present a general methodology for extracting multi-word expressions (of various types), along with their translations, from small parallel corpora. We automatically align the p...
Yulia Tsvetkov, Shuly Wintner