Search Sciweavers | Sciweavers

109

NSDI
2010

194views Computer Networks» more NSDI 2010»

The Architecture and Implementation of an Extensible Web Crawler

15 years 3 months ago

Many Web services operate their own Web crawlers to discover data of interest, despite the fact that largescale, timely crawling is complex, operationally intensive, and expensive...

Jonathan M. Hsieh, Steven D. Gribble, Henry M. Lev...

claim paper

Read More »

122

Voted

WWW
2009
ACM

157views Internet Technology» more WWW 2009»

Data quality in web archiving

16 years 2 months ago

Download www.dl.kuis.kyoto-u.ac.jp

Web archives preserve the history of Web sites and have high long-term value for media and business analysts. Such archives are maintained by periodically re-crawling entire Web s...

Marc Spaniol, Dimitar Denev, Arturas Mazeika, Gerh...

claim paper

Read More »

96

click to vote

ESWS
2008
Springer

144views Internet Technology» more ESWS 2008»

Semantic Sitemaps: Efficient and Flexible Access to Datasets on the Semantic Web

15 years 3 months ago

Download www.eswc2008.org

Increasing amounts of RDF data are available on the Web for consumption by Semantic Web browsers and indexing by Semantic Web search engines. Current Semantic Web publishing practi...

Richard Cyganiak, Holger Stenzhorn, Renaud Delbru,...

claim paper

Read More »

120

click to vote

DEBU
2002

116views more DEBU 2002»

The Role of Web Services in Information Search

15 years 1 months ago

Download www.mpi-inf.mpg.de

State-of-the-art Web search engines are inherently limited in their abilities to search information in Deep Web beyond portals. This paper discusses how Web services and Semantic-...

Jens Graupmann, Gerhard Weikum

claim paper

Read More »

115

click to vote

WWW
2007
ACM

285views Internet Technology» more WWW 2007»

GigaHash: scalable minimal perfect hashing for billions of urls

16 years 2 months ago

Download www2007.org

A minimal perfect function maps a static set of keys on to the range of integers {0,1,2, ... , - 1}. We present a scalable high performance algorithm based on random graphs for ...

Kumar Chellapilla, Anton Mityagin, Denis Xavier Ch...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers