Sciweavers

WWW
2001
ACM

Building a distributed full-text index for the Web

14 years 5 months ago
Building a distributed full-text index for the Web
We identify crucial design issues in building a distributed inverted index for a large collection of web pages. We introduce a novel pipelining technique for structuring the core index-building system that substantially reduces the index construction time. We also propose a storage scheme for creating and managing inverted files using an embedded database system. We suggest and compare different strategies for collecting global statistics from distributed inverted indexes. Finally, we present performance results from experiments on a testbed distributed indexing system that we have implemented.
Sergey Melnik, Sriram Raghavan, Beverly Yang, Hect
Added 22 Nov 2009
Updated 22 Nov 2009
Type Conference
Year 2001
Where WWW
Authors Sergey Melnik, Sriram Raghavan, Beverly Yang, Hector Garcia-Molina
Comments (0)