Sciweavers

299 search results - page 2 / 60
» User-centric Web crawling
Sort
View
DEBU
2002
135views more  DEBU 2002»
13 years 4 months ago
Analyzing Fine-grained Hypertext Features for Enhanced Crawling and Topic Distillation
Early Web search engines closely resembled Information Retrieval (IR) systems which had matured over several decades. Around 1996
Soumen Chakrabarti, Ravindra Jaju
WWW
2007
ACM
14 years 5 months ago
Random web crawls
This paper proposes a random Web crawl model. A Web crawl is a (biased and partial) image of the Web. This paper deals with the hyperlink structure, i.e. a Web crawl is a graph, w...
Toufik Bennouas, Fabien de Montgolfier
PVLDB
2008
124views more  PVLDB 2008»
13 years 4 months ago
Google's Deep Web crawl
The Deep Web, i.e., content hidden behind HTML forms, has long been acknowledged as a significant gap in search engine coverage. Since it represents a large portion of the structu...
Jayant Madhavan, David Ko, Lucja Kot, Vignesh Gana...
JUCS
2010
135views more  JUCS 2010»
13 years 3 months ago
Locating and Crawling eGovernment Services A Light-weight Semantic Approach
Abstract: The application of Web 2.0 tools and methodologies in the domain of eGovernment is not yet a fully exploited area due to the immaturity of the software support, and the l...
Luis Álvarez Sabucedo, Luis E. Anido-Rif&oa...
PVLDB
2008
173views more  PVLDB 2008»
13 years 4 months ago
AJAXSearch: crawling, indexing and searching web 2.0 applications
Cristian Duda, Gianni Frey, Donald Kossmann, Chong...