Search Sciweavers | Sciweavers

73 search results - page 1 / 15

» Compression-based document length prior for language models

click to vote

ECIR
2008
Springer

134views Information Technology» more ECIR 2008»

Probabilistic Document Length Priors for Language Models

13 years 6 months ago

Download www.dc.fi.udc.es

This paper addresses the issue of devising a new document prior for the language modeling (LM) approach for Information Retrieval. The prior is based on term statistics, derived in...

Roi Blanco, Alvaro Barreiro

claim paper

Read More »

click to vote

SIGIR
2009
ACM

123views Information Technology» more SIGIR 2009»

Compression-based document length prior for language models

13 years 11 months ago

Download www.dc.fi.udc.es

The inclusion of document length factors has been a major topic in the development of retrieval models. We believe that current models can be further improved by more reﬁned est...

Javier Parapar, David E. Losada, Alvaro Barreiro

claim paper

Read More »

click to vote

IR
2008

105views Natural Language Processing» more IR 2008»

An analysis on document length retrieval trends in language modeling smoothing

13 years 5 months ago

Download www.gsi.dec.usc.es

Abstract. Document length is widely recognized as an important factor for adjusting retrieval systems. Many models tend to favor the retrieval of either short or long documents and...

David E. Losada, Leif Azzopardi

claim paper

Read More »

click to vote

TREC
2004

127views Information Technology» more TREC 2004»

Language Models for Searching in Web Corpora

13 years 6 months ago

Download trec.nist.gov

: We describe our participation in the TREC 2004 Web and Terabyte tracks. For the web track, we employ mixture language models based on document full-text, incoming anchortext, and...

Jaap Kamps, Gilad Mishne, Maarten de Rijke

claim paper

Read More »

click to vote

ACL
2003

106views Computational Linguistics» more ACL 2003»

Unsupervised Segmentation of Words Using Prior Distributions of Morph Length and Frequency

13 years 6 months ago

Download www.aclweb.org

We present a language-independent and unsupervised algorithm for the segmentation of words into morphs. The algorithm is based on a new generative probabilistic model, which makes...

Mathias Creutz

claim paper

Read More »

« Prev « First page 1 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers