Sciweavers

1098 search results - page 26 / 220
» Compressed web indexes
Sort
View
ICDAR
1999
IEEE
15 years 2 months ago
DjVu: Analyzing and Compressing Scanned Documents for Internet Distribution
DjVu is an image compression technique specifically geared towards the compression of scanned documents in color at high resolution. Typical magazine pages in color scanned at 300...
Patrick Haffner, Léon Bottou, Paul G. Howar...
DAS
1998
Springer
15 years 2 months ago
Group 4 Compressed Document Matching
Numerous approaches, including textual, structural and featural, to detecting duplicate documents have been investigated. Considering document images are usually stored and transm...
Dar-Shyang Lee, Jonathan J. Hull
JUCS
2011
113views more  JUCS 2011»
14 years 4 months ago
Nabuco - Two Decades of Document Processing in Latin America
: This paper reports on the Joaquim Nabuco Project, a pioneering work in Latin America on document digitalization, enhancement, compression, indexing, retrieval and network transmi...
Rafael Dueire Lins
ICDE
2007
IEEE
167views Database» more  ICDE 2007»
15 years 11 months ago
DSphere: A Source-Centric Approach to Crawling, Indexing and Searching the World Wide Web
We describe DSPHERE1 - a decentralized system for crawling, indexing, searching and ranking of documents in the World Wide Web. Unlike most of the existing search technologies tha...
Bhuvan Bamba, Ling Liu, James Caverlee, Vaibhav Pa...
SIGIR
2011
ACM
14 years 17 days ago
Inverted indexes for phrases and strings
Inverted indexes are the most fundamental and widely used data structures in information retrieval. For each unique word occurring in a document collection, the inverted index sto...
Manish Patil, Sharma V. Thankachan, Rahul Shah, Wi...