Search Sciweavers | Sciweavers

96 search results - page 1 / 20

» Detecting Near-replicas on the Web by Content and Hyperlink ...

click to vote

WWW
2003
ACM

139views Internet Technology» more WWW 2003»

Detecting Near-replicas on the Web by Content and Hyperlink Analysis

14 years 5 months ago

Download nautilus.dii.unisi.it

The presence of replicas or near-replicas of documents is very common on the Web. Documents may be replicated completely or partially for different reasons (versions, mirrors, etc...

Ernesto Di Iorio, Michelangelo Diligenti, Marco Go...

claim paper

Read More »

click to vote

AIRWEB
2008
Springer

126views Internet Technology» more AIRWEB 2008»

Web spam identification through content and hyperlinks

13 years 6 months ago

Download airweb.cse.lehigh.edu

We present an algorithm, witch, that learns to detect spam hosts or pages on the Web. Unlike most other approaches, it simultaneously exploits the structure of the Web graph as we...

Jacob Abernethy, Olivier Chapelle, Carlos Castillo

claim paper

Read More »

click to vote

SIGKDD
2008

248views more SIGKDD 2008»

Web data mining: exploring hyperlinks, contents, and usage data

13 years 4 months ago

Download www.sigkdd.org

This paper presents a review of the book "Web Data Mining - Exploring Hyperlinks, Contents, and Usage Data" by Bing Liu. The review concludes that the breadth and depth ...

Olfa Nasraoui

claim paper

Read More »

click to vote

HT
2003
ACM

131views Internet Technology» more HT 2003»

Enhanced web document summarization using hyperlinks

13 years 10 months ago

Download www.mariapinto.es

This paper addresses the issue of Web document summarization. As textual content of Web documents is often scarce or irrelevant and existing summarization techniques are based on ...

Jean-Yves Delort, Bernadette Bouchon-Meunier, Mari...

claim paper

Read More »

click to vote

SIGIR
1998
ACM

117views Information Technology» more SIGIR 1998»

Improved Algorithms for Topic Distillation in a Hyperlinked Environment

13 years 9 months ago

Download www.isgroup.unimo.it

This paper addresses the problem of topic distillation on the World Wide Web, namely, given a typical user query to ﬁnd quality documents related to the query topic. Connectivity...

Krishna Bharat, Monika Rauch Henzinger

claim paper

Read More »

« Prev « First page 1 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers