Search Sciweavers | Sciweavers

133

KDD
2006
ACM

185views Data Mining» more KDD 2006»

Understanding Content Reuse on the Web: Static and Dynamic Analyses

16 years 4 months ago

Abstract. In this paper we present static and dynamic studies of duplicate and near-duplicate documents in the Web. The static and dynamic studies involve the analysis of similar c...

Ricardo A. Baeza-Yates, Álvaro R. Pereira J...

claim paper

Read More »

171

click to vote

DAGSTUHL
2007

180views Software Engineering» more DAGSTUHL 2007»

Exploiting Community Behavior for Enhanced Link Analysis and Web Search

15 years 5 months ago

Download db.ucsd.edu

Methods for Web link analysis and authority ranking such as PageRank are based on the assumption that a user endorses a Web page when creating a hyperlink to this page. There is a...

Julia Luxenburger, Gerhard Weikum

claim paper

Read More »

120

click to vote

APIN
2005

107views more APIN 2005»

Multi-Instance Learning Based Web Mining

15 years 4 months ago

Download cs.nju.edu.cn

In multi-instance learning, the training set comprises labeled bags that are composed of unlabeled instances, and the task is to predict the labels of unseen bags. In this paper, ...

Zhi-Hua Zhou, Kai Jiang, Ming Li

claim paper

Read More »

163

click to vote

WEBDB
2001
Springer

137views Database» more WEBDB 2001»

Using Database Technology to Improve Performance of Web Proxy Servers

15 years 8 months ago

Download www.is.kyusan-u.ac.jp

In this paper, we propose to use database technology to improve performance of web proxy servers. We view the cache at a proxy server as a web warehouse with data organized in a h...

Kai Cheng, Yahiko Kambayashi, Mukesh K. Mohania

claim paper

Read More »

139

click to vote

UAI
2003

109views Artificial Intelligence» more UAI 2003»

Exploiting Locality in Searching the Web

15 years 5 months ago

Download www.cs.brown.edu

Published experiments on spidering the Web suggest that, given training data in the form of a (relatively small) subgraph of the Web containing a subset of a selected class of tar...

Joel Young, Thomas Dean

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers