Sciweavers

EMNLP
2007
13 years 6 months ago
Improving Translation Quality by Discarding Most of the Phrasetable
It is possible to reduce the bulk of phrasetables for Statistical Machine Translation using a technique based on the significance testing of phrase pair co-occurrence in the para...
Howard Johnson, Joel D. Martin, George F. Foster, ...
LREC
2010
150views Education» more  LREC 2010»
13 years 6 months ago
Achieving Domain Specificity in SMT without Overt Siloing
We examine pooling data as a method for improving Statistical Machine Translation (SMT) quality for narrowly defined domains, such as data for a particular company or public entit...
William D. Lewis, Chris Wendt, David Bullock
LREC
2010
181views Education» more  LREC 2010»
13 years 6 months ago
Linguistically Motivated Unsupervised Segmentation for Machine Translation
In this paper we use statistical machine translation and morphology information from two different morphological analyzers to try to improve translation quality by linguistically ...
Mark Fishel, Harri Kirik
COLING
2008
13 years 6 months ago
Domain Adaptation for Statistical Machine Translation with Domain Dictionary and Monolingual Corpora
tra Statistical machine translation systems are usually trained on large amounts of bilingual text and monolingual text. In this paper, we propose a method to perform domain adapta...
Hua Wu, Haifeng Wang, Chengqing Zong
ACL
2008
13 years 6 months ago
Segmentation for English-to-Arabic Statistical Machine Translation
In this paper, we report on a set of initial results for English-to-Arabic Statistical Machine Translation (SMT). We show that morphological decomposition of the Arabic source is ...
Ibrahim Badr, Rabih Zbib, James R. Glass
ACL
2008
13 years 6 months ago
Exploiting N-best Hypotheses for SMT Self-Enhancement
Word and n-gram posterior probabilities estimated on N-best hypotheses have been used to improve the performance of statistical machine translation (SMT) in a rescoring framework....
Boxing Chen, Min Zhang, AiTi Aw, Haizhou Li
ACL
2008
13 years 6 months ago
Recent Improvements in the CMU Large Scale Chinese-English SMT System
In this paper we describe recent improvements to components and methods used in our statistical machine translation system for ChineseEnglish used in the January 2008 GALE evaluat...
Almut Silja Hildebrand, Kay Rottmann, Mohamed Noam...
ACL
2007
13 years 6 months ago
Extending MARIE: an N-gram-based SMT decoder
In this paper we present several extensions of MARIE1 , a freely available N-gram-based statistical machine translation (SMT) decoder. The extensions mainly consist of the ability...
Josep Maria Crego, José B. Mariño
ACL
2008
13 years 6 months ago
Measure Word Generation for English-Chinese SMT Systems
Measure words in Chinese are used to indicate the count of nouns. Conventional statistical machine translation (SMT) systems do not perform well on measure word generation due to ...
Dongdong Zhang, Mu Li, Nan Duan, Chi-Ho Li, Ming Z...
ACL
2008
13 years 6 months ago
Name Translation in Statistical Machine Translation - Learning When to Transliterate
We present a method to transliterate names in the framework of end-to-end statistical machine translation. The system is trained to learn when to transliterate. For Arabic to Engl...
Ulf Hermjakob, Kevin Knight, Hal Daumé III