Sciweavers

1133 search results - page 3 / 227
» Distributed community crawling
Sort
View
ADMA
2009
Springer
142views Data Mining» more  ADMA 2009»
14 years 12 days ago
Crawling Deep Web Using a New Set Covering Algorithm
Abstract. Crawling the deep web often requires the selection of an appropriate set of queries so that they can cover most of the documents in the data source with low cost. This ca...
Yan Wang, Jianguo Lu, Jessica Chen
CN
2000
75views more  CN 2000»
13 years 5 months ago
Graph structure in the Web
The study of the web as a graph is not only fascinating in its own right, but also yields valuable insight into web algorithms for crawling, searching and community discovery, and...
Andrei Z. Broder, Ravi Kumar, Farzin Maghoul, Prab...
JASIS
2008
86views more  JASIS 2008»
13 years 5 months ago
Metadata harvesting for content-based distributed information retrieval
We propose an approach to content-based Distributed Information Retrieval based on the periodic and incremental centralisation of full-content indices of widely dispersed and auto...
Fabio Simeoni, Murat Yakici, Steve Neely, Fabio Cr...
PVLDB
2008
124views more  PVLDB 2008»
13 years 5 months ago
Google's Deep Web crawl
The Deep Web, i.e., content hidden behind HTML forms, has long been acknowledged as a significant gap in search engine coverage. Since it represents a large portion of the structu...
Jayant Madhavan, David Ko, Lucja Kot, Vignesh Gana...
SIGIR
2003
ACM
13 years 11 months ago
Apoidea: A Decentralized Peer-to-Peer Architecture for Crawling the World Wide Web
This paper describes a decentralized peer-to-peer model for building a Web crawler. Most of the current systems use a centralized client-server model, in which the crawl is done by...
Aameek Singh, Mudhakar Srivatsa, Ling Liu, Todd Mi...