Search Sciweavers | Sciweavers

199

WWW
2001
ACM

150views Internet Technology» more WWW 2001»

Effective Web data extraction with standard XML technologies

16 years 8 months ago

We discuss the problem of Web data extraction and describe an XML-based methodology whose goal extends far beyond simple "screen scraping." An ideal data extraction proc...

Jussi Myllymaki

claim paper

Read More »

249

click to vote

WWW
2011
ACM

258views Internet Technology» more WWW 2011»

Prophiler: a fast filter for the large-scale detection of malicious web pages

15 years 2 months ago

Download www.iseclab.org

Malicious web pages that host drive-by-download exploits have become a popular means for compromising hosts on the Internet and, subsequently, for creating large-scale botnets. In...

Davide Canali, Marco Cova, Giovanni Vigna, Christo...

claim paper

Read More »

194

click to vote

ECIR
2006
Springer

134views Information Technology» more ECIR 2006»

Automatic Document Organization in a P2P Environment

15 years 9 months ago

Download ir.shef.ac.uk

Abstract. This paper describes an efficient method to construct reliable machine learning applications in peer-to-peer (P2P) networks by building ensemble based meta methods. We co...

Stefan Siersdorfer, Sergej Sizov

claim paper

Read More »

205

click to vote

SIGIR
2006
ACM

178views Information Technology» more SIGIR 2006»

AggregateRank: bringing order to web sites

16 years 1 months ago

Download research.microsoft.com

Since the website is one of the most important organizational structures of the Web, how to effectively rank websites has been essential to many Web applications, such as Web sear...

Guang Feng, Tie-Yan Liu, Ying Wang, Ying Bao, Zhim...

claim paper

Read More »

187

click to vote

ICIP
2000
IEEE

141views Image Processing» more ICIP 2000»

Efficient Video Similarity Measurement and Search

16 years 9 months ago

Download www.vis.uky.edu

We consider the use of meta-data and/or video-domain methods to detect similar videos on the web. Meta-data is extracted from the textual and hyperlink information associated with...

Sen-Ching S. Cheung, Avideh Zakhor

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers