Sciweavers

1109 search results - page 28 / 222
» Crawling on web graphs
Sort
View
JWSR
2007
172views more  JWSR 2007»
15 years 1 months ago
Service Class Driven Dynamic Data Source Discovery with DynaBot
: Dynamic Web data sources – sometimes known collectively as the Deep Web – increase the utility of the Web by providing intuitive access to data repositories anywhere that Web...
Daniel Rocco, James Caverlee, Ling Liu, Terence Cr...
VALUETOOLS
2006
ACM
166views Hardware» more  VALUETOOLS 2006»
15 years 7 months ago
Web graph analyzer tool
We present the software tool “Web Graph Analyzer”. This tool is designed to perform a comprehensive analysis of the Web Graph structure. By Web Graph we mean a graph whose ver...
Konstantin Avrachenkov, Danil Nemirovsky, Natalia ...
ICDE
2002
IEEE
161views Database» more  ICDE 2002»
16 years 3 months ago
Design and Implementation of a High-Performance Distributed Web Crawler
Broad web search engines as well as many more specialized search tools rely on web crawlers to acquire large collections of pages for indexing and analysis. Such a web crawler may...
Vladislav Shkapenyuk, Torsten Suel
WWW
2007
ACM
16 years 2 months ago
A large-scale study of robots.txt
Search engines largely rely on Web robots to collect information from the Web. Due to the unregulated open-access nature of the Web, robot activities are extremely diverse. Such c...
Yang Sun, Ziming Zhuang, C. Lee Giles
126
Voted
ICCS
2007
Springer
15 years 5 months ago
Estimating the Change of Web Pages
This paper presents the estimation methods computing the probabilities of how many times web pages are downloaded and modified, respectively, in the future crawls. The methods can ...
Sung Jin Kim, Sang Ho Lee