Search Sciweavers | Sciweavers

63

CN
1998

54views more CN 1998»

14 years 9 months ago

In this paper we study in what order a crawler should visit the URLs it has seen, in order to obtain more "important" pages first. Obtaining important pages rapidly can ...

Junghoo Cho, Hector Garcia-Molina, Lawrence Page

claim paper

Read More »

83

click to vote

HT
2006
ACM

92views Internet Technology» more HT 2006»

Evaluation of crawling policies for a web-repository crawler

15 years 3 months ago

Download www.cs.odu.edu

We have developed a web-repository crawler that is used for reconstructing websites when backups are unavailable. Our crawler retrieves web resources from the Internet Archive, Go...

Frank McCown, Michael L. Nelson

claim paper

Read More »

136

click to vote

CORR
2012
Springer

292views Education» more CORR 2012»

Optimal Threshold Control by the Robots of Web Search Engines with Obsolescence of Documents

13 years 5 months ago

Download www-sop.inria.fr

A typical web search engine consists of three principal parts: crawling engine, indexing engine, and searching engine. The present work aims to optimize the performance of the cra...

Konstantin Avrachenkov, Alexander N. Dudin, Valent...

claim paper

Read More »

97

click to vote

SIGIR
2002
ACM

78views Information Technology» more SIGIR 2002»

Do TREC web collections look like the web?

14 years 9 months ago

Download www.sigir.org

We measure the WT10g test collection, used in the TREC-9 and TREC 2001 Web Tracks, and the .GOV test collection used in the TREC 2002 Web and Interactive Tracks, with common measu...

Ian Soboroff

claim paper

Read More »

76

click to vote

ICAPR
2005
Springer

130views Pattern Recognition» more ICAPR 2005»

Combining Text and Link Analysis for Focused Crawling

15 years 3 months ago

Download poseidon.csd.auth.gr

The number of vertical search engines and portals has rapidly increased over the last years, making the importance of a topic-driven (focused) crawler evident. In this paper, we de...

George Almpanidis, Constantine Kotropoulos

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers