Sciweavers

593 search results - page 19 / 119
» Chunk-Based Statistical Translation
Sort
View
EMNLP
2009
14 years 11 months ago
Unsupervised Tokenization for Machine Translation
Training a statistical machine translation starts with tokenizing a parallel corpus. Some languages such as Chinese do not incorporate spacing in their writing system, which creat...
Tagyoung Chung, Daniel Gildea
89
Voted
EMNLP
2009
14 years 11 months ago
A Syntactified Direct Translation Model with Linear-time Decoding
Recent syntactic extensions of statistical translation models work with a synchronous context-free or tree-substitution grammar extracted from an automatically parsed parallel cor...
Hany Hassan, Khalil Sima'an, Andy Way
ACL
2012
13 years 4 months ago
Prediction of Learning Curves in Machine Translation
Parallel data in the domain of interest is the key resource when training a statistical machine translation (SMT) system for a specific purpose. Since ad-hoc manual translation c...
Prasanth Kolachina, Nicola Cancedda, Marc Dymetman...
112
Voted
COLING
2010
14 years 9 months ago
Learning Phrase Boundaries for Hierarchical Phrase-based Translation
Hierarchical phrase-based models provide a powerful mechanism to capture non-local phrase reorderings for statistical machine translation (SMT). However, many phrase reorderings a...
Zhongjun He, Yao Meng, Hao Yu
113
Voted
ACL
2012
13 years 4 months ago
NiuTrans: An Open Source Toolkit for Phrase-based and Syntax-based Machine Translation
We present a new open source toolkit for phrase-based and syntax-based machine translation. The toolkit supports several state-of-the-art models developed in statistical machine t...
Tong Xiao, Jingbo Zhu, Hao Zhang, Qiang Li