Sciweavers

287 search results - page 50 / 58
» Mixing Multiple Translation Models in Statistical Machine Tr...
Sort
View
88
Voted
EMNLP
2008
14 years 11 months ago
Improved Sentence Alignment on Parallel Web Pages Using a Stochastic Tree Alignment Model
Parallel web pages are important source of training data for statistical machine translation. In this paper, we present a new approach to sentence alignment on parallel web pages....
Lei Shi, Ming Zhou
FCCM
2008
IEEE
212views VLSI» more  FCCM 2008»
15 years 3 months ago
Map-reduce as a Programming Model for Custom Computing Machines
The map-reduce model requires users to express their problem in terms of a map function that processes single records in a stream, and a reduce function that merges all mapped out...
Jackson H. C. Yeung, C. C. Tsang, Kuen Hung Tsoi, ...
CIVR
2004
Springer
117views Image Analysis» more  CIVR 2004»
15 years 2 months ago
Using Maximum Entropy for Automatic Image Annotation
In this paper, we propose the use of the Maximum Entropy approach for the task of automatic image annotation. Given labeled training data, Maximum Entropy is a statistical techniqu...
Jiwoon Jeon, R. Manmatha
TASLP
2008
201views more  TASLP 2008»
14 years 9 months ago
Syntactically Lexicalized Phrase-Based SMT
Abstract--Until quite recently, extending Phrase-based Statistical Machine Translation (PBSMT) with syntactic knowledge caused system performance to deteriorate. The most recent su...
Hany Hassan, Khalil Sima'an, Andy Way
EMNLP
2009
14 years 7 months ago
Polylingual Topic Models
Topic models are a useful tool for analyzing large text collections, but have previously been applied in only monolingual, or at most bilingual, contexts. Meanwhile, massive colle...
David M. Mimno, Hanna M. Wallach, Jason Naradowsky...