Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

8

LREC
2010

favoriteEmaildiscussreport

181views Education» more LREC 2010»

Linguistically Motivated Unsupervised Segmentation for Machine Translation

13 years 6 months ago

Linguistically Motivated Unsupervised Segmentation for Machine Translation

Download www.lrec-conf.org

In this paper we use statistical machine translation and morphology information from two different morphological analyzers to try to improve translation quality by linguistically motivated segmentation. The morphological analyzers we use are the unsupervised Morfessor morpheme segmentation and analyzer toolkit and the rule-based morphological analyzer T3. Our translations are done using the Moses statistical machine translation toolkit with training on the JRC-Acquis corpora and translating on Estonian to English and English to Estonian language directions. In our work we model such linguistic phenomena as word lemmas and endings and splitting compound words into simpler parts. Also lemma information was used to introduce new factors to the corpora and to use this information for better word alignment or for alternative path back-off translation. From the results we find that even though these methods have shown previously and keep showing promise of improved translation, their succes...

Mark Fishel, Harri Kirik

Real-time Traffic

Education | LREC 2010 | Morphological Analyzer | Rule-based Morphological Analyzer | Statistical Machine Translation |

claim paper

Related Content

» Combining Morphemebased Machine Translation with Postprocessing Morpheme Prediction

» Nonparametric Word Segmentation for Machine Translation

» Unsupervised Search for the Optimal Segmentation for Statistical Machine Translation

» Bilingually Motivated DomainAdapted Word Segmentation for Statistical Machine Translation

» Contextual Modeling for Meeting Translation Using Unsupervised Word Sense Disambiguation

» Enhancing Statistical Machine Translation with Character Alignment

» Unsupervised Discriminative Language Model Training for Machine Translation using Simulate...

» Linguistically Annotated BTG for Statistical Machine Translation

» Unsupervised cleansing of noisy text

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	LREC
Authors	Mark Fishel, Harri Kirik

Comments (0)