Sciweavers

SIGIR
2011
ACM
12 years 7 months ago
When documents are very long, BM25 fails!
We reveal that the Okapi BM25 retrieval function tends to overly penalize very long documents. To address this problem, we present a simple yet effective extension of BM25, namel...
Yuanhua Lv, ChengXiang Zhai
JASIS
2010
121views more  JASIS 2010»
13 years 3 months ago
Linear time series models for term weighting in information retrieval
Common measures of term importance in information retrieval (IR) rely on counts of term frequency; rare terms receive higher weight in document ranking than common terms receive. ...
Miles Efron
ECIR
2010
Springer
13 years 6 months ago
Semantically Enhanced Term Frequency
In this paper, we complement the term frequency, which is used in many bag-of-words based information retrieval models, with information about the semantic relatedness of query and...
Christof Müller, Iryna Gurevych
SPIRE
2007
Springer
13 years 11 months ago
Extending Weighting Models with a Term Quality Measure
Abstract. Weighting models use lexical statistics, such as term frequencies, to derive term weights, which are used to estimate the relevance of a document to a query. Apart from t...
Christina Lioma, Iadh Ounis
ISDA
2008
IEEE
13 years 11 months ago
Compute the Term Contributed Frequency
In this paper, we propose an algorithm and data structure for computing the term contributed frequency (tcf) for all N-grams in a text corpus. Although term frequency is one of th...
Cheng-Lung Sung, Hsu-Chun Yen, Wen-Lian Hsu