Sciweavers

WWW
2004
ACM

Design of a crawler with bounded bandwidth

14 years 5 months ago
Design of a crawler with bounded bandwidth
This paper presents an algorithm to bound the bandwidth of a Web crawler. The crawler collects statistics on the transfer rate of each server to predict the expected bandwidth use for future downloads. The prediction allows us to activate the optimal number of fetcher threads in order to exploit the assigned bandwidth. The experimental results show the effectiveness of the proposed technique. Categories and Subject Descriptors: H.3 [Information Systems]: Information Storage and Retrieval General Terms: Design, Experimentation, Performance
Michelangelo Diligenti, Marco Maggini, Filippo Mar
Added 22 Nov 2009
Updated 22 Nov 2009
Type Conference
Year 2004
Where WWW
Authors Michelangelo Diligenti, Marco Maggini, Filippo Maria Pucci, Franco Scarselli
Comments (0)