Sciweavers

182 search results - page 1 / 37
» Probabilistic Document Length Priors for Language Models
Sort
View
ECIR
2008
Springer
13 years 6 months ago
Probabilistic Document Length Priors for Language Models
This paper addresses the issue of devising a new document prior for the language modeling (LM) approach for Information Retrieval. The prior is based on term statistics, derived in...
Roi Blanco, Alvaro Barreiro
SIGIR
2009
ACM
13 years 11 months ago
Compression-based document length prior for language models
The inclusion of document length factors has been a major topic in the development of retrieval models. We believe that current models can be further improved by more refined est...
Javier Parapar, David E. Losada, Alvaro Barreiro
ACL
2003
13 years 5 months ago
Unsupervised Segmentation of Words Using Prior Distributions of Morph Length and Frequency
We present a language-independent and unsupervised algorithm for the segmentation of words into morphs. The algorithm is based on a new generative probabilistic model, which makes...
Mathias Creutz
IR
2008
13 years 4 months ago
An analysis on document length retrieval trends in language modeling smoothing
Abstract. Document length is widely recognized as an important factor for adjusting retrieval systems. Many models tend to favor the retrieval of either short or long documents and...
David E. Losada, Leif Azzopardi
UAI
2008
13 years 6 months ago
The Phylogenetic Indian Buffet Process: A Non-Exchangeable Nonparametric Prior for Latent Features
Nonparametric Bayesian models are often based on the assumption that the objects being modeled are exchangeable. While appropriate in some applications (e.g., bag-ofwords models f...
Kurt T. Miller, Thomas L. Griffiths, Michael I. Jo...