Sciweavers

ECIR
2006
Springer
13 years 5 months ago
An Efficient Computation of the Multiple-Bernoulli Language Model
Abstract. The Multiple Bernoulli (MB) Language Model has been generally considered too computationally expensive for practical purposes and superseded by the more efficient multino...
Leif Azzopardi, David E. Losada
ACL
2006
13 years 5 months ago
Discriminative Pruning of Language Models for Chinese Word Segmentation
This paper presents a discriminative pruning method of n-gram language model for Chinese word segmentation. To reduce the size of the language model that is used in a Chinese word...
Jianfeng Li, Haifeng Wang, Dengjun Ren, Guohua Li
ACL
2006
13 years 5 months ago
Reduced n-gram Models for English and Chinese Corpora
Statistical language models should improve as the size of the n-grams increases from 3 to 5 or higher. However, the number of parameters and calculations, and the storage requirem...
Le Quan Ha, Philip Hanna, Darryl Stewart, F. Jack ...
ACL
2004
13 years 5 months ago
Head-Driven Parsing for Word Lattices
We present the first application of the head-driven statistical parsing model of Collins (1999) as a simultaneous language model and parser for largevocabulary speech recognition....
Christopher Collins, Bob Carpenter, Gerald Penn
NAACL
2007
13 years 6 months ago
Language Modeling for Determiner Selection
We present a method for automatic determiner selection, based on an existing language model. We train on the Penn Treebank and also use additional data from the North American New...
Jenine Turner, Eugene Charniak
NAACL
2007
13 years 6 months ago
Joint Morphological-Lexical Language Modeling for Machine Translation
We present a joint morphological-lexical language model (JMLLM) for use in statistical machine translation (SMT) of language pairs where one or both of the languages are morpholog...
Ruhi Sarikaya, Yonggang Deng
NIPS
2008
13 years 6 months ago
A Scalable Hierarchical Distributed Language Model
Neural probabilistic language models (NPLMs) have been shown to be competitive with and occasionally superior to the widely-used n-gram language models. The main drawback of NPLMs...
Andriy Mnih, Geoffrey E. Hinton
LREC
2008
146views Education» more  LREC 2008»
13 years 6 months ago
On the Use of Web Resources and Natural Language Processing Techniques to Improve Automatic Speech Recognition Systems
Language models used in current automatic speech recognition systems are trained on general-purpose corpora and are therefore not relevant to transcribe spoken documents dealing w...
Gwénolé Lecorvé, Guillaume Gr...
LREC
2008
141views Education» more  LREC 2008»
13 years 6 months ago
A Comparative Study on Language Identification Methods
In this paper we present two experiments conducted for comparison of different language identification algorithms. Short words-, frequent words- and n-gram-based approaches are co...
Lena Grothe, Ernesto William De Luca, Andreas N&uu...
ECIR
2008
Springer
13 years 6 months ago
Optimizing Language Models for Polarity Classification
Abstract. This paper investigates the usage of various types of language models on polarity text classification
Michael Wiegand, Dietrich Klakow