Sciweavers

54 search results - page 2 / 11
» Capturing collection size for distributed non-cooperative re...
Sort
View
ECIR
2009
Springer
14 years 2 months ago
A Topic-Based Measure of Resource Description Quality for Distributed Information Retrieval
The aim of query-based sampling is to obtain a sufficient, representative sample of an underlying (text) collection. Current measures for assessing sample quality are too coarse gr...
Mark Baillie, Mark James Carman, Fabio Crestani
SIGIR
2008
ACM
13 years 4 months ago
Generalising multiple capture-recapture to non-uniform sample sizes
Algorithms in distributed information retrieval often rely on accurate knowledge of the size of a collection. The "multiple capture-recapture" method of Shokouhi et al. ...
Paul Thomas
SIGIR
2010
ACM
12 years 11 months ago
The 8th workshop on large-scale distributed systems for information retrieval (LSDS-IR'10)
The size of the Web as well as user bases of search systems continue to grow exponentially. Consequently, providing subsecond query response times and high query throughput become...
Roi Blanco, Berkant Barla Cambazoglu, Claudio Lucc...
ERCIMDL
2005
Springer
114views Education» more  ERCIMDL 2005»
13 years 10 months ago
Compressing Dynamic Text Collections via Phrase-Based Coding
We present a new statistical compression method, which we call Phrase Based Dense Code (PBDC), aimed at compressing large digital libraries. PBDC compresses the text collection to ...
Nieves R. Brisaboa, Antonio Fariña, Gonzalo...
IPM
2007
123views more  IPM 2007»
13 years 4 months ago
Using query logs to establish vocabularies in distributed information retrieval
Users of search engines express their needs as queries, typically consisting of a small number of terms. The resulting search engine query logs are valuable resources that can be ...
Milad Shokouhi, Justin Zobel, Seyed M. M. Tahaghog...