Sciweavers

51 search results - page 4 / 11
» Improved index compression techniques for versioned document...
Sort
View
WWW
2007
ACM
15 years 10 months ago
Using d-gap patterns for index compression
Sequential patterns of d-gaps exist pervasively in inverted lists of Web document collection indices due to the cluster property. In this paper the information of d-gap sequential...
Jinlin Chen, Terry Cook
IPM
2007
95views more  IPM 2007»
14 years 9 months ago
Using structural contexts to compress semistructured text collections
We describe a compression model for semistructured documents, called Structural Contexts Model (SCM), which takes advantage of the context information usually implicit in the stru...
Joaquín Adiego, Gonzalo Navarro, Pablo de l...
SIGIR
2011
ACM
14 years 8 days ago
Inverted indexes for phrases and strings
Inverted indexes are the most fundamental and widely used data structures in information retrieval. For each unique word occurring in a document collection, the inverted index sto...
Manish Patil, Sharma V. Thankachan, Rahul Shah, Wi...
IJDAR
2007
127views more  IJDAR 2007»
14 years 9 months ago
Word matching using single closed contours for indexing handwritten historical documents
Abstract. Effective indexing is crucial for providing convenient access to scanned versions of large collections of handwritten historical manuscripts. Since traditional handwritin...
Tomasz Adamek, Noel E. O'Connor, Alan F. Smeaton
SPIRE
2010
Springer
14 years 7 months ago
Dual-Sorted Inverted Lists
Several IR tasks rely, to achieve high efficiency, on a single pervasive data structure called the inverted index. This is a mapping from the terms in a text collection to the docu...
Gonzalo Navarro, Simon J. Puglisi