Search Sciweavers | Sciweavers

157

Voted

PVLDB
2010

161views more PVLDB 2010»

Annotating and Searching Web Tables Using Entities, Types and Relationships

15 years 1 months ago

Tables are a universal idiom to present relational data. Billions of tables on Web pages express entity references, attributes and relationships. This representation of relational...

Girija Limaye, Sunita Sarawagi, Soumen Chakrabarti

claim paper

Read More »

150

click to vote

WWW
2011
ACM

258views Internet Technology» more WWW 2011»

Prophiler: a fast filter for the large-scale detection of malicious web pages

14 years 10 months ago

Download www.iseclab.org

Malicious web pages that host drive-by-download exploits have become a popular means for compromising hosts on the Internet and, subsequently, for creating large-scale botnets. In...

Davide Canali, Marco Cova, Giovanni Vigna, Christo...

claim paper

Read More »

135

Voted

CIDR
2009

129views Algorithms» more CIDR 2009»

Extracting and Querying a Comprehensive Web Database

15 years 4 months ago

Download turing.cs.washington.edu

Recent research in domain-independent information extraction holds the promise of an automatically-constructed structured database derived from the Web. A query system based on th...

Michael J. Cafarella

claim paper

Read More »

162

Voted

SIGIR
2008
ACM

176views Information Technology» more SIGIR 2008»

SpotSigs: robust and efficient near duplicate detection in large web collections

15 years 3 months ago

Download ilpubs.stanford.edu

Motivated by our work with political scientists who need to manually analyze large Web archives of news sites, we present SpotSigs, a new algorithm for extracting and matching sig...

Martin Theobald, Jonathan Siddharth, Andreas Paepc...

claim paper

Read More »

123

Voted

WWW
2004
ACM

179views Internet Technology» more WWW 2004»

Combining link and content analysis to estimate semantic similarity

16 years 4 months ago

Download www.informatics.indiana.edu

Search engines use content and link information to crawl, index, retrieve, and rank Web pages. The correlations between similarity measures based on these cues and on semantic ass...

Filippo Menczer

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers