Sciweavers

472 search results - page 1 / 95
» Crawling the Hidden Web
Sort
View
IADIS
2004
13 years 6 months ago
Crawling the client-side hidden web
There is a great amount of information on the web that can not be accessed by conventional crawler engines. This portion of the web is usually called hidden web data. To be able t...
Manuel Álvarez, Alberto Pan, Juan Raposo, &...
JUCS
2008
124views more  JUCS 2008»
13 years 4 months ago
Structure-Based Crawling in the Hidden Web
: The number of applications that need to crawl the Web to gather data is growing at an ever increasing pace. In some cases, the criterion to determine what pages must be included ...
Márcio L. A. Vidal, Altigran Soares da Silv...
WWW
2001
ACM
14 years 5 months ago
Crawling the Hidden Web
Current-day crawlers retrieve content only from the publicly indexable Web, i.e., the set of Web pages reachable purely by following hypertext links, ignoring search forms and pag...
Sriram Raghavan, Hector Garcia-Molina
ICCSA
2007
Springer
13 years 11 months ago
Crawling the Content Hidden Behind Web Forms
The crawler engines of today cannot reach most of the information contained in the Web. A great amount of valuable information is “hidden” behind the query forms of online data...
Manuel Álvarez, Juan Raposo, Alberto Pan, F...
WIDM
2004
ACM
13 years 10 months ago
Probabilistic models for focused web crawling
A Focused crawler must use information gleaned from previously crawled page sequences to estimate the relevance of a newly seen URL. Therefore, good performance depends on powerfu...
Hongyu Liu, Evangelos E. Milios, Jeannette Janssen