Sciweavers

ADC
2003
Springer
153views Database» more  ADC 2003»
13 years 10 months ago
Automated Discovery of Search Interfaces on the Web
Web search engines work well for finding crawlable pages, but not for finding datasets hidden behind Web search forms. We describe a novel technique for detecting search forms, ...
Jared Cope, Nick Craswell, David Hawking
WWW
2001
ACM
14 years 5 months ago
Crawling the Hidden Web
Current-day crawlers retrieve content only from the publicly indexable Web, i.e., the set of Web pages reachable purely by following hypertext links, ignoring search forms and pag...
Sriram Raghavan, Hector Garcia-Molina