Sciweavers

82 search results - page 12 / 17
» Learning Bilingual Lexicons from Monolingual Corpora
Sort
View
MT
2007
100views more  MT 2007»
14 years 9 months ago
Semi-supervised model adaptation for statistical machine translation
Statistical machine translation systems are usually trained on large amounts of bilingual text (used to learn a translation model), and also large amounts of monolingual text in th...
Nicola Ueffing, Gholamreza Haffari, Anoop Sarkar
ACL
2006
14 years 11 months ago
Semi-Supervised Learning of Partial Cognates Using Bilingual Bootstrapping
Partial cognates are pairs of words in two languages that have the same meaning in some, but not all contexts. Detecting the actual meaning of a partial cognate in context can be ...
Oana Frunza, Diana Zaiu Inkpen
EACL
2003
ACL Anthology
14 years 11 months ago
Empirical Methods for Compound Splitting
Compounded words are a challenge for NLP applications such as machine translation (MT). We introduce methods to learn splitting rules from monolingual and parallel corpora. We eva...
Philipp Koehn, Kevin Knight
96
Voted
LREC
2010
199views Education» more  LREC 2010»
14 years 11 months ago
Building a Cross-lingual Relatedness Thesaurus using a Graph Similarity Measure
The Internet is an ever growing source of information stored in documents of different languages. Hence, cross-lingual resources are needed for more and more NLP applications. Thi...
Lukas Michelbacher, Florian Laws, Beate Dorow, Ulr...
CORR
1999
Springer
81views Education» more  CORR 1999»
14 years 9 months ago
Inducing a Semantically Annotated Lexicon via EM-Based Clustering
We present a technique for automatic induction of slot annotations for subcategorization frames, based on induction of hidden classes in the EM framework of statistical estimation...
Mats Rooth, Stefan Riezler, Detlef Prescher, Glenn...