Search Sciweavers | Sciweavers

92

CIKM
2009
Springer

121views Information Technology» more CIKM 2009»

Graph-based seed selection for web-scale crawlers

15 years 9 months ago

One of the most important steps in web crawling is determining the starting points, or seed selection. This paper identiﬁes and explores the problem of seed selection in webscal...

Shuyi Zheng, Pavel Dmitriev, C. Lee Giles

claim paper

Read More »

106

click to vote

USS
2008

120views Operating System» more USS 2008»

There Is No Free Phish: An Analysis of "Free" and Live Phishing Kits

15 years 5 months ago

Download www.cs.ucsb.edu

Phishing is a form of identity theft in which an attacker attempts to elicit confidential information from unsuspecting victims. While in the past there has been significant work ...

Marco Cova, Christopher Kruegel, Giovanni Vigna

claim paper

Read More »

121

click to vote

ECIR
2006
Springer

134views Information Technology» more ECIR 2006»

Automatic Document Organization in a P2P Environment

15 years 4 months ago

Download ir.shef.ac.uk

Abstract. This paper describes an efficient method to construct reliable machine learning applications in peer-to-peer (P2P) networks by building ensemble based meta methods. We co...

Stefan Siersdorfer, Sergej Sizov

claim paper

Read More »

177

click to vote

SIGIR
2010
ACM

173views Information Technology» more SIGIR 2010»

The 8th workshop on large-scale distributed systems for information retrieval (LSDS-IR'10)

14 years 9 months ago

Download www.sigir.org

The size of the Web as well as user bases of search systems continue to grow exponentially. Consequently, providing subsecond query response times and high query throughput become...

Roi Blanco, Berkant Barla Cambazoglu, Claudio Lucc...

claim paper

Read More »

124

Voted

VLDB
2011
ACM

251views Database» more VLDB 2011»

Harvesting relational tables from lists on the web

14 years 10 months ago

Download www.vldb.org

A large number of web pages contain data structured in the form of “lists”. Many such lists can be further split into multi-column tables, which can then be used in more seman...

Hazem Elmeleegy, Jayant Madhavan, Alon Y. Halevy

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers