Sciweavers

938 search results - page 59 / 188
» Space-Efficient Algorithms for Document Retrieval
Sort
View
CIKM
2009
Springer
15 years 6 months ago
Compact full-text indexing of versioned document collections
We study the problem of creating highly compressed fulltext index structures for versioned document collections, that is, collections that contain multiple versions of each docume...
Jinru He, Hao Yan, Torsten Suel
SAC
2009
ACM
15 years 6 months ago
Combining statistics and semantics via ensemble model for document clustering
Incorporating background knowledge into data mining algorithms is an important but challenging problem. Current approaches in semi-supervised learning require explicit knowledge p...
Samah Jamal Fodeh, William F. Punch, Pang-Ning Tan
SIGIR
2009
ACM
15 years 6 months ago
SUSHI: scoring scaled samples for server selection
Modern techniques for distributed information retrieval use a set of documents sampled from each server, but these samples have been underutilised in server selection. We describe...
Paul Thomas, Milad Shokouhi
ECIR
2010
Springer
15 years 1 months ago
Text Clustering for Peer-to-Peer Networks with Probabilistic Guarantees
Text clustering is an established technique for improving quality in information retrieval, for both centralized and distributed environments. However, for highly distributed envir...
Odysseas Papapetrou, Wolf Siberski, Norbert Fuhr
CIKM
2007
Springer
15 years 6 months ago
A knowledge-based search engine powered by wikipedia
This paper describes Koru, a new search interface that offers effective domain-independent knowledge-based information retrieval. Koru exhibits an understanding of the topics of b...
David N. Milne, Ian H. Witten, David M. Nichols