Sciweavers

133 search results - page 2 / 27
» Joint Tokenization and Translation
Sort
View
AAAI
2000
14 years 11 months ago
Estimating Word Translation Probabilities from Unrelated Monolingual Corpora Using the EM Algorithm
Selecting the right word translation among several options in the lexicon is a core problem for machine translation. We present a novel approach to this problem that can be traine...
Philipp Koehn, Kevin Knight
CICLING
2009
Springer
15 years 10 months ago
Enriching Statistical Translation Models Using a Domain-Independent Multilingual Lexical Knowledge Base
This paper presents a method for improving phrase-based Statistical Machine Translation systems by enriching the original translation model with information derived from a multilin...
Miguel García, Jesús Giménez,...
COLING
2010
14 years 4 months ago
Joint Parsing and Translation
Tree-based translation models, which exploit the linguistic syntax of source language, usually separate decoding into two steps: parsing and translation. Although this separation ...
Yang Liu, Qun Liu
78
Voted
CORR
1998
Springer
99views Education» more  CORR 1998»
14 years 9 months ago
Models of Co-occurrence
A model of co-occurrence in bitext is a boolean predicate that indicates whether a given pair of word tokens co-occur in corresponding regions of the bitext space. Co-occurrence i...
I. Dan Melamed
COLING
2010
14 years 4 months ago
Unsupervised cleansing of noisy text
In this paper we look at the problem of cleansing noisy text using a statistical machine translation model. Noisy text is produced in informal communications such as Short Message...
Danish Contractor, Tanveer A. Faruquie, L. Venkata...