Sciweavers

21 search results - page 1 / 5
» Constructing Parallel Corpus from Movie Subtitles
Sort
View
ICCPOL
2009
Springer
13 years 9 months ago
Constructing Parallel Corpus from Movie Subtitles
Abstract. This paper describes a methodology for constructing aligned German-Chinese corpora from movie subtitles. The corpora will be used to train a special machine translation s...
Han Xiao, Xiaojie Wang
LREC
2008
88views Education» more  LREC 2008»
13 years 6 months ago
Using Movie Subtitles for Creating a Large-Scale Bilingual Corpora
This paper presents a method for compiling a large-scale bilingual corpus from a database of movie subtitles. To create the corpus, we propose an algorithm based on Gale and Churc...
Einav Itamar, Alon Itai
LREC
2008
102views Education» more  LREC 2008»
13 years 6 months ago
Synchronizing Translated Movie Subtitles
This paper addresses the problem of synchronizing movie subtitles, which is necessary to improve alignment quality when building a parallel corpus out of translated subtitles. In ...
Jörg Tiedemann
LREC
2010
172views Education» more  LREC 2010»
13 years 6 months ago
Evaluating Utility of Data Sources in a Large Parallel Czech-English Corpus CzEng 0.9
CzEng 0.9 is the third release of a large parallel corpus of Czech and English. For the current release, CzEng was extended by significant amount of texts from various types of so...
Ondrej Bojar, Adam Liska, Zdenek Zabokrtský
LREC
2010
175views Education» more  LREC 2010»
13 years 3 months ago
Creating a Reusable English-Chinese Parallel Corpus for Bilingual Dictionary Construction
This paper first describes an experiment to construct an English-Chinese parallel corpus, then applying the Uplug word alignment tool on the corpus and finally produce and evaluat...
Hercules Dalianis, Hao-chun Xing, Xin Zhang