Sciweavers

1255 search results - page 209 / 251
» Information-Based Machine Translation
Sort
View
104
Voted
EMNLP
2009
14 years 10 months ago
Polylingual Topic Models
Topic models are a useful tool for analyzing large text collections, but have previously been applied in only monolingual, or at most bilingual, contexts. Meanwhile, massive colle...
David M. Mimno, Hanna M. Wallach, Jason Naradowsky...
COLING
2010
14 years 7 months ago
EMDC: A Semi-supervised Approach for Word Alignment
This paper proposes a novel semisupervised word alignment technique called EMDC that integrates discriminative and generative methods. A discriminative aligner is used to find hig...
Qin Gao, Francisco Guzmán, Stephan Vogel
108
Voted
COLING
2010
14 years 7 months ago
Leveraging Multiple MT Engines for Paraphrase Generation
This paper proposes a method that leverages multiple machine translation (MT) engines for paraphrase generation (PG). The method includes two stages. Firstly, we use a multi-pivot...
Shiqi Zhao, Haifeng Wang, Xiang Lan, Ting Liu
INTERSPEECH
2010
14 years 7 months ago
A spoken term detection framework for recovering out-of-vocabulary words using the web
Vocabulary restrictions in large vocabulary continuous speech recognition (LVCSR) systems mean that out-of-vocabulary (OOV) words are lost in the output. However, OOV words tend t...
Carolina Parada, Abhinav Sethy, Mark Dredze, Frede...
95
Voted
ICASSP
2011
IEEE
14 years 4 months ago
The IBM 2009 GALE Arabic speech transcription system
We describe the Arabic broadcast transcription system elded by IBM in the GALE Phase 4 machine translation evaluation. Key advances over our Phase 3.5 system include improvements ...
Brian Kingsbury, Hagen Soltau, George Saon, Stephe...