Sciweavers

154 search results - page 1 / 31
» Discriminative n-gram language modeling
Sort
View
CICLING
2010
Springer
13 years 8 months ago
Word Length n-Grams for Text Re-use Detection
Abstract. The automatic detection of shared content in written documents –which includes text reuse and its unacknowledged commitment, plagiarism– has become an important probl...
Alberto Barrón-Cedeño, Chiara Basile...
COLING
2010
12 years 11 months ago
Unsupervised Discriminative Language Model Training for Machine Translation using Simulated Confusion Sets
An unsupervised discriminative training procedure is proposed for estimating a language model (LM) for machine translation (MT). An English-to-English synchronous context-free gra...
Zhifei Li, Ziyuan Wang, Sanjeev Khudanpur, Jason E...
INTERSPEECH
2010
12 years 11 months ago
Improved language recognition using mixture components statistics
One successful approach to language recognition is to focus on the most discriminative high level features of languages, such as phones and words. In this paper, we applied a simi...
Abualsoud Hanani, Michael J. Carey 0002, Martin J....
EMNLP
2009
13 years 2 months ago
Extending Statistical Machine Translation with Discriminative and Trigger-Based Lexicon Models
In this work, we propose two extensions of standard word lexicons in statistical machine translation: A discriminative word lexicon that uses sentence-level source information to ...
Arne Mauser, Sasa Hasan, Hermann Ney