Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

85

Voted

EMNLP
2008

favoriteEmaildiscussreport

75views Natural Language Processing» more EMNLP 2008»

Forest-based Translation Rule Extraction

15 years 15 days ago

Forest-based Translation Rule Extraction

Download www.cis.upenn.edu

Translation rule extraction is a fundamental problem in machine translation, especially for linguistically syntax-based systems that need parse trees from either or both sides of the bitext. The current dominant practice only uses 1-best trees, which adversely affects the rule set quality due to parsing errors. So we propose a novel approach which extracts rules from a packed forest that compactly encodes exponentially many parses. Experiments show that this method improves translation quality by over 1 BLEU point on a state-of-the-art tree-to-string system, and is 0.5 points better than (and twice as fast as) extracting on 30best parses. When combined with our previous work on forest-based decoding, it achieves a 2.5 BLEU points improvement over the baseline, and even outperforms the hierarchical system of Hiero by 0.7 points.

Haitao Mi, Liang Huang

Real-time Traffic

Bleu Points | EMNLP 2008 | Natural Language Processing | Rule Set Quality | Translation Rule Extraction |

claim paper

Related Content

» Akamon An Open Source Toolkit for TreeForestBased Statistical Machine Translation

» Effective Use of Function Words for Rule Generalization in ForestBased Translation

» FineGrained TreetoString Translation Rule Extraction

» Hierarchical PhraseBased Translation Grammars Extracted from Alignment Posterior Probabili...

» Discriminative Modeling of Extraction Sets for Machine Translation

» Learning Better Rule Extraction with Translation Span Alignment

» Better Filtration and Augmentation for Hierarchical PhraseBased Translation Rules

» An Empirical Study of Translation Rule Extraction with Multiple Parsers

» Automatic Extraction of Translational JapaneseKATAKANA and English Word Pairs

» Statistical Machine Translation with a Factorized Grammar

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	EMNLP
Authors	Haitao Mi, Liang Huang

Comments (0)