Sciweavers

COLING
2010
12 years 11 months ago
Fast-Champollion: A Fast and Robust Sentence Alignment Algorithm
Sentence-level aligned parallel texts are important resources for a number of natural language processing (NLP) tasks and applications such as statistical machine translation and ...
Peng Li, Maosong Sun, Ping Xue
ANLP
2000
163views more  ANLP 2000»
13 years 5 months ago
Automatic construction of parallel English-Chinese corpus for cross-language information retrieval
A major obstacle to the construction of a probabilistic translation model is the lack of large parallel corpora. In this paper we first describe a parallel text mining system that...
Jiang Chen, Jian-Yun Nie
LREC
2008
132views Education» more  LREC 2008»
13 years 6 months ago
Babylon Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages
This paper describes BABYLON, a system that attempts to overcome the shortage of parallel texts in low-density languages by supplementing existing parallel texts with texts gather...
Michael Mohler, Rada Mihalcea