Sciweavers

INTERSPEECH
2010
12 years 10 months ago
Learning a language model from continuous speech
This paper presents a new approach to language model construction, learning a language model not from text, but directly from continuous speech. A phoneme lattice is created using...
Graham Neubig, Masato Mimura, Shinsuke Mori, Tatsu...
COLING
2010
12 years 10 months ago
Improving Reordering with Linguistically Informed Bilingual n-grams
We present a new reordering model estimated as a standard n-gram language model with units built from morphosyntactic information of the source and target languages. It can be see...
Josep Maria Crego, François Yvon
EMNLP
2009
13 years 1 months ago
Using the Web for Language Independent Spellchecking and Autocorrection
We have designed, implemented and evaluated an end-to-end system spellchecking and autocorrection system that does not require any manually annotated training data. The World Wide...
Casey Whitelaw, Ben Hutchinson, Grace Chung, Ged E...
ACL
2009
13 years 1 months ago
Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation
We describe Joshua (Li et al., 2009a)1, an open source toolkit for statistical machine translation. Joshua implements all of the algorithms required for translation via synchronou...
Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri ...
ACL
2006
13 years 5 months ago
Distortion Models for Statistical Machine Translation
In this paper, we argue that n-gram language models are not sufficient to address word reordering required for Machine Translation. We propose a new distortion model that can be u...
Yaser Al-Onaizan, Kishore Papineni
ACL
2004
13 years 5 months ago
Head-Driven Parsing for Word Lattices
We present the first application of the head-driven statistical parsing model of Collins (1999) as a simultaneous language model and parser for largevocabulary speech recognition....
Christopher Collins, Bob Carpenter, Gerald Penn
COLING
2008
13 years 5 months ago
Translating Queries into Snippets for Improved Query Expansion
User logs of search engines have recently been applied successfully to improve various aspects of web search quality. In this paper, we will apply pairs of user queries and snippe...
Stefan Riezler, Yi Liu, Alexander Vasserman
ACL
2008
13 years 5 months ago
A New String-to-Dependency Machine Translation Algorithm with a Target Dependency Language Model
In this paper, we propose a novel string-todependency algorithm for statistical machine translation. With this new framework, we employ a target dependency language model during d...
Libin Shen, Jinxi Xu, Ralph M. Weischedel
CSE
2009
IEEE
13 years 10 months ago
A Language of Life: Characterizing People Using Cell Phone Tracks
—Mobile devices can produce continuous streams of data which are often specific to the person carrying them. We show that cell phone tracks from the MIT Reality dataset can be u...
Alexy Khrabrov, George Cybenko