Search Sciweavers | Sciweavers

125 search results - page 2 / 25

» Minimizing the Network Distance in Distributed Web Crawling

148

click to vote

SIGIR
2003
ACM

159views Information Technology» more SIGIR 2003»

Apoidea: A Decentralized Peer-to-Peer Architecture for Crawling the World Wide Web

15 years 9 months ago

Download www.aameeksingh.com

This paper describes a decentralized peer-to-peer model for building a Web crawler. Most of the current systems use a centralized client-server model, in which the crawl is done by...

Aameek Singh, Mudhakar Srivatsa, Ling Liu, Todd Mi...

claim paper

Read More »

205

click to vote

CORR
2012
Springer

292views Education» more CORR 2012»

Optimal Threshold Control by the Robots of Web Search Engines with Obsolescence of Documents

13 years 11 months ago

Download www-sop.inria.fr

A typical web search engine consists of three principal parts: crawling engine, indexing engine, and searching engine. The present work aims to optimize the performance of the cra...

Konstantin Avrachenkov, Alexander N. Dudin, Valent...

claim paper

Read More »

242

Voted

ICDE
2002
IEEE

161views Database» more ICDE 2002»

Design and Implementation of a High-Performance Distributed Web Crawler

16 years 5 months ago

Download cis.poly.edu

Broad web search engines as well as many more specialized search tools rely on web crawlers to acquire large collections of pages for indexing and analysis. Such a web crawler may...

Vladislav Shkapenyuk, Torsten Suel

claim paper

Read More »

154

click to vote

DEXAW
2010
IEEE

181views Database» more DEXAW 2010»

Towards a Search System for the Web Exploiting Spatial Data of a Web Document

15 years 5 months ago

Download laclavik.net

In this paper, we describe our work in progress in the scope of information retrieval exploiting the spatial data extracted from web documents. We discuss problems of a search for ...

Stefan Dlugolinsky, Michal Laclavik, Ladislav Hluc...

claim paper

Read More »

146

click to vote

CIKM
2005
Springer

143views Information Technology» more CIKM 2005»

Focused crawling for both topical relevance and quality of medical information

15 years 9 months ago

Download research.microsoft.com

Subject-speciﬁc search facilities on health sites are usually built using manual inclusion and exclusion rules. These can be expensive to maintain and often provide incomplete c...

Thanh Tin Tang, David Hawking, Nick Craswell, Kath...

claim paper

Read More »

« Prev « First page 2 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers