An Online Relevant Set Algorithm for Statistical Machine Translation

11 years 6 months ago
An Online Relevant Set Algorithm for Statistical Machine Translation
This paper presents a novel online relevant set algorithm for a linearly-scored block sequence translation model. The key component is a new procedure to directly optimize the global scoring function used by a statistical machine translation (SMT) decoder. This training procedure treats the decoder as a black-box, and thus can be used to optimize any decoding scheme. The novel algorithm is evaluated using different feature types: 1) commonly used probabilistic features, such as translation, language, or distortion model probabilities, and 2) binary features. In particular, encouraging results on a standard ArabicEnglish translation task are presented for a translation system that uses only binary feature functions. To further demonstrate the effectiveness of the novel training algorithm, a detailed comparison with the widely used minimum-error-rate (MER) training algorithm [2] is presented using the same decoder and feature set. The online algorithm is simplified by introducing so-call...
Christoph Tillmann, Tong Zhang
Added 15 Dec 2010
Updated 15 Dec 2010
Type Journal
Year 2008
Authors Christoph Tillmann, Tong Zhang
Comments (0)