Sciweavers

73 search results - page 1 / 15
» Compression-based document length prior for language models
Sort
View
ECIR
2008
Springer
13 years 6 months ago
Probabilistic Document Length Priors for Language Models
This paper addresses the issue of devising a new document prior for the language modeling (LM) approach for Information Retrieval. The prior is based on term statistics, derived in...
Roi Blanco, Alvaro Barreiro
SIGIR
2009
ACM
13 years 11 months ago
Compression-based document length prior for language models
The inclusion of document length factors has been a major topic in the development of retrieval models. We believe that current models can be further improved by more refined est...
Javier Parapar, David E. Losada, Alvaro Barreiro
IR
2008
13 years 5 months ago
An analysis on document length retrieval trends in language modeling smoothing
Abstract. Document length is widely recognized as an important factor for adjusting retrieval systems. Many models tend to favor the retrieval of either short or long documents and...
David E. Losada, Leif Azzopardi
TREC
2004
13 years 6 months ago
Language Models for Searching in Web Corpora
: We describe our participation in the TREC 2004 Web and Terabyte tracks. For the web track, we employ mixture language models based on document full-text, incoming anchortext, and...
Jaap Kamps, Gilad Mishne, Maarten de Rijke
ACL
2003
13 years 6 months ago
Unsupervised Segmentation of Words Using Prior Distributions of Morph Length and Frequency
We present a language-independent and unsupervised algorithm for the segmentation of words into morphs. The algorithm is based on a new generative probabilistic model, which makes...
Mathias Creutz