Search Sciweavers | Sciweavers

116

WWW
2010
ACM

220views Internet Technology» more WWW 2010»

Not so creepy crawler: easy crawler generation with standard xml queries

15 years 8 months ago

Web crawlers are increasingly used for focused tasks such as the extraction of data from Wikipedia or the analysis of social networks like last.fm. In these cases, pages are far m...

Franziska von dem Bussche, Klara A. Weiand, Benedi...

claim paper

Read More »

104

click to vote

WWW
2008
ACM

121views Internet Technology» more WWW 2008»

16 years 2 months ago

Web graph similarity for anomaly detection (poster)

Download www2008.org

Web graphs are approximate snapshots of the web, created by search engines. Their creation is an error-prone procedure that relies on the availability of Internet nodes and the fa...

Panagiotis Papadimitriou 0002, Ali Dasdan, Hector ...

claim paper

Read More »

111

click to vote

WWW
2009
ACM

135views Internet Technology» more WWW 2009»

User-centric content freshness metrics for search engines

16 years 2 months ago

Download www2009.org

In order to return relevant search results, a search engine must keep its local repository synchronized to the Web, but it is usually impossible to attain perfect freshness. Hence...

Ali Dasdan, Xinh Huynh

claim paper

Read More »

141

click to vote

EDBT
2006
ACM

137views Database» more EDBT 2006»

IQN Routing: Integrating Quality and Novelty in P2P Querying and Ranking

16 years 2 months ago

Download lsirpeople.epfl.ch

Abstract. We consider a collaboration of peers autonomously crawling the Web. A pivotal issue when designing a peer-to-peer (P2P) Web search engine in this environment is query rou...

Sebastian Michel, Matthias Bender, Peter Triantafi...

claim paper

Read More »

125

Voted

DEXA
2010
Springer

226views Database» more DEXA 2010»

Vi-DIFF: Understanding Web Pages Changes

15 years 13 days ago

Download www-poleia.lip6.fr

Nowadays, many applications are interested in detecting and discovering changes on the web to help users to understand page updates and more generally, the web dynamics. Web archiv...

Zeynep Pehlivan, Myriam Ben Saad, Stéphane ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers