Sciweavers

ACL
2010
14 years 7 months ago
Learning Common Grammar from Multilingual Corpus
We propose a corpus-based probabilistic framework to extract hidden common syntax across languages from non-parallel multilingual corpora in an unsupervised fashion. For this purp...
Tomoharu Iwata, Daichi Mochihashi, Hiroshi Sawada
CLEF
2010
Springer
14 years 9 months ago
Creating a Persian-English Comparable Corpus
Multilingual corpora are valuable resources for cross-language information retrieval and are available in many language pairs. However the Persian language does not have rich multi...
Homa Baradaran Hashemi, Azadeh Shakery, Heshaam Fe...
IJCAI
2007
14 years 10 months ago
Pseudo-Aligned Multilingual Corpora
In machine translation, document alignment refers to finding correspondences between documents which are exact translations of each other. We define pseudo-alignment as the task...
Fernando Diaz, Donald Metzler