Sciweavers

ACL
2012
11 years 7 months ago
DOMCAT: A Bilingual Concordancer for Domain-Specific Computer Assisted Translation
In this paper, we propose a web-based bilingual concordancer, DOMCAT 1 , for domain-specific computer assisted translation. Given a multi-word expression as a query, the system in...
Ming-Hong Bai, Yu-Ming Hsieh, Keh-Jiann Chen, Jaso...
ACL
2011
12 years 8 months ago
An Unsupervised Model for Joint Phrase Alignment and Extraction
We present an unsupervised model for joint phrase alignment and extraction using nonparametric Bayesian methods and inversion transduction grammars (ITGs). The key contribution is...
Graham Neubig, Taro Watanabe, Eiichiro Sumita, Shi...
ACL
2006
13 years 5 months ago
Extracting Parallel Sub-Sentential Fragments from Non-Parallel Corpora
We present a novel method for extracting parallel sub-sentential fragments from comparable, non-parallel bilingual corpora. By analyzing potentially similar sentence pairs using a...
Dragos Stefan Munteanu, Daniel Marcu
LREC
2010
187views Education» more  LREC 2010»
13 years 6 months ago
Belgisch Staatsblad Corpus: Retrieving French-Dutch Sentences from Official Documents
We describe the compilation of a large corpus of French-Dutch sentence pairs from official Belgian documents which are available in the online version of the publication Belgisch ...
Tom Vanallemeersch
AIRS
2004
Springer
13 years 8 months ago
Combining Sentence Length with Location Information to Align Monolingual Parallel Texts
Abundant Chinese paraphrasing resource on Internet can be attained from different Chinese translations of one foreign masterpiece. Paraphrases corpus is the corpus that includes s...
Weigang Li, Ting Liu, Sheng Li