Sciweavers

ACL
2012
11 years 7 months ago
Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining
Long-span features, such as syntax, can improve language models for tasks such as speech recognition and machine translation. However, these language models can be difficult to u...
Ariya Rastrow, Mark Dredze, Sanjeev Khudanpur
CIKM
2011
Springer
12 years 5 months ago
Supervised language modeling for temporal resolution of texts
We investigate temporal resolution of documents, such as determining the date of publication of a story based on its text. We describe and evaluate a model that build histograms e...
Abhimanu Kumar, Matthew Lease, Jason Baldridge
CIKM
2011
Springer
12 years 5 months ago
Focusing on novelty: a crawling strategy to build diverse language models
Word prediction performed by language models has an important role in many tasks as e.g. word sense disambiguation, speech recognition, hand-writing recognition, query spelling an...
Luciano Barbosa, Srinivas Bangalore
KCAP
2011
ACM
12 years 8 months ago
Language resources extracted from Wikipedia
Wikipedia provides an interesting amount of text for more than hundred languages. This also includes languages where no reference corpora or other linguistic resources are easily ...
Denny Vrandecic, Philipp Sorg, Rudi Studer
ECIR
2011
Springer
12 years 8 months ago
Exploiting Thread Structures to Improve Smoothing of Language Models for Forum Post Retrieval
Due to many unique characteristics of forum data, forum post retrieval is different from traditional document retrieval and web search, raising interesting research questions abou...
Huizhong Duan, Chengxiang Zhai
ACL
2011
12 years 8 months ago
Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers
In this paper, with a belief that a language model that embraces a larger context provides better prediction ability, we present two extensions to standard n-gram language models ...
Deyi Xiong, Min Zhang, Haizhou Li
ICASSP
2011
IEEE
12 years 8 months ago
Exploiting sparseness of backing-off language models for efficient look-ahead in LVCSR
In this paper, we propose a new method for computing and applying language model look-ahead in a dynamic network decoder, exploiting the sparseness of backing-off n-gram language ...
David Nolden, Hermann Ney, Ralf Schlüter
TASLP
2010
97views more  TASLP 2010»
12 years 11 months ago
Hierarchical Bayesian Language Models for Conversational Speech Recognition
Traditional n-gram language models are widely used in state-of-the-art large vocabulary speech recognition systems. This simple model suffers from some limitations, such as overfi...
Songfang Huang, Steve Renals
INTERSPEECH
2010
12 years 12 months ago
Recurrent neural network based language model
A new recurrent neural network based language model (RNN LM) with applications to speech recognition is presented. Results indicate that it is possible to obtain around 50% reduct...
Tomas Mikolov, Martin Karafiát, Lukas Burge...
ICDAR
2009
IEEE
13 years 2 months ago
Recurrent HMMs and Cursive Handwriting Recognition Graphs
Standard cursive handwriting recognition is based on a language model, mostly a lexicon of possible word hypotheses or character n-grams. The result is a list of word alternatives...
Marc-Peter Schambach