Sciweavers

48 search results - page 1 / 10
» Unsupervised Tokenization for Machine Translation
Sort
View
EMNLP
2009
13 years 2 months ago
Unsupervised Tokenization for Machine Translation
Training a statistical machine translation starts with tokenizing a parallel corpus. Some languages such as Chinese do not incorporate spacing in their writing system, which creat...
Tagyoung Chung, Daniel Gildea
COLING
2010
12 years 12 months ago
Unsupervised cleansing of noisy text
In this paper we look at the problem of cleansing noisy text using a statistical machine translation model. Noisy text is produced in informal communications such as Short Message...
Danish Contractor, Tanveer A. Faruquie, L. Venkata...
AUSAI
2006
Springer
13 years 8 months ago
Unsupervised Measurement of Translation Quality Using Multi-engine, Bi-directional Translation
Abstract. Lay people discussing machine translation systems often perform a round trip translation, that is translating a text into a foreign language and back, to measure the qual...
Menno van Zaanen, Simon Zwarts
COLING
2010
12 years 12 months ago
Contextual Modeling for Meeting Translation Using Unsupervised Word Sense Disambiguation
In this paper we investigate the challenges of applying statistical machine translation to meeting conversations, with a particular view towards analyzing the importance of modeli...
Mei Yang, Katrin Kirchhoff
ACL
2006
13 years 6 months ago
Weakly Supervised Named Entity Transliteration and Discovery from Multilingual Comparable Corpora
Named Entity recognition (NER) is an important part of many natural language processing tasks. Current approaches often employ machine learning techniques and require supervised d...
Alexandre Klementiev, Dan Roth