Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

151

ACL
2010

140views Computational Linguistics» more ACL 2010»

Error Detection for Statistical Machine Translation Using Linguistic Features

15 years 3 months ago

Error Detection for Statistical Machine Translation Using Linguistic Features

Download www1.i2r.a-star.edu.sg

Automatic error detection is desired in the post-processing to improve machine translation quality. The previous work is largely based on confidence estimation using system-based features, such as word posterior probabilities calculated from Nbest lists or word lattices. We propose to incorporate two groups of linguistic features, which convey information from outside machine translation systems, into error detection: lexical and syntactic features. We use a maximum entropy classifier to predict translation errors by integrating word posterior probability feature and linguistic features. The experimental results show that 1) linguistic features alone outperform word posterior probability based confidence estimation in error detection; and 2) linguistic features can further provide complementary information when combined with word confidence scores, which collectively reduce the classification error rate by 18.52% and improve the F measure by 16.37%.

Deyi Xiong, Min Zhang, Haizhou Li

Real-time Traffic

ACL 2010 | Computational Linguistics | Confidence Estimation | Linguistic Features | Word Posterior Probability |

claim paper

Related Content

» Effective Use of Linguistic and Contextual Information for Statistical Machine Translation

» Contextual Modeling for Meeting Translation Using Unsupervised Word Sense Disambiguation

» Bilingual Sense Similarity for Statistical Machine Translation

» Demonstration of Joshua An Open Source Toolkit for Parsingbased Machine Translation

» Latticebased Minimum Error Rate Training for Statistical Machine Translation

» Random Restarts in Minimum Error Rate Training for Statistical Machine Translation

» Akamon An Open Source Toolkit for TreeForestBased Statistical Machine Translation

» Minimum BayesRisk Decoding for Statistical Machine Translation

» Improved Models of Distortion Cost for Statistical Machine Translation

Post Info
More Details (n/a)

Added	10 Feb 2011
Updated	10 Feb 2011
Type	Journal
Year	2010
Where	ACL
Authors	Deyi Xiong, Min Zhang, Haizhou Li

Comments (0)