Sciweavers

48 search results - page 5 / 10
» Unsupervised Tokenization for Machine Translation
Sort
View
ICDS
2007
IEEE
15 years 3 months ago
Automatic Acquisition of Translation Knowledge Using Structural Matching Between Parse Trees
— In this paper we present a rule-based formalism for the representation, acquisition, and application of translation knowledge. The formalism is being used successfully in a Jap...
Werner Winiwarter
ACL
2006
14 years 11 months ago
Unsupervised Analysis for Decipherment Problems
We study a number of natural language decipherment problems using unsupervised learning. These include letter substitution ciphers, character code conversion, phonetic deciphermen...
Kevin Knight, Anish Nair, Nishit Rathod, Kenji Yam...
EMNLP
2011
13 years 9 months ago
Quasi-Synchronous Phrase Dependency Grammars for Machine Translation
We present a quasi-synchronous dependency grammar (Smith and Eisner, 2006) for machine translation in which the leaves of the tree are phrases rather than words as in previous wor...
Kevin Gimpel, Noah A. Smith
ACL
2010
14 years 7 months ago
Discriminative Modeling of Extraction Sets for Machine Translation
We present a discriminative model that directly predicts which set of phrasal translation rules should be extracted from a sentence pair. Our model scores extraction sets: nested ...
John DeNero, Dan Klein
ACL
2008
14 years 11 months ago
Unsupervised Translation Induction for Chinese Abbreviations using Monolingual Corpora
Chinese abbreviations are widely used in modern Chinese texts. Compared with English abbreviations (which are mostly acronyms and truncations), the formation of Chinese abbreviati...
Zhifei Li, David Yarowsky