Sciweavers

2553 search results - page 146 / 511
» How-To Web Pages
Sort
View
KDD
2006
ACM
185views Data Mining» more  KDD 2006»
16 years 4 months ago
Understanding Content Reuse on the Web: Static and Dynamic Analyses
Abstract. In this paper we present static and dynamic studies of duplicate and near-duplicate documents in the Web. The static and dynamic studies involve the analysis of similar c...
Ricardo A. Baeza-Yates, Álvaro R. Pereira J...
DAGSTUHL
2007
15 years 5 months ago
Exploiting Community Behavior for Enhanced Link Analysis and Web Search
Methods for Web link analysis and authority ranking such as PageRank are based on the assumption that a user endorses a Web page when creating a hyperlink to this page. There is a...
Julia Luxenburger, Gerhard Weikum
APIN
2005
107views more  APIN 2005»
15 years 4 months ago
Multi-Instance Learning Based Web Mining
In multi-instance learning, the training set comprises labeled bags that are composed of unlabeled instances, and the task is to predict the labels of unseen bags. In this paper, ...
Zhi-Hua Zhou, Kai Jiang, Ming Li
WEBDB
2001
Springer
137views Database» more  WEBDB 2001»
15 years 8 months ago
Using Database Technology to Improve Performance of Web Proxy Servers
In this paper, we propose to use database technology to improve performance of web proxy servers. We view the cache at a proxy server as a web warehouse with data organized in a h...
Kai Cheng, Yahiko Kambayashi, Mukesh K. Mohania
UAI
2003
15 years 5 months ago
Exploiting Locality in Searching the Web
Published experiments on spidering the Web suggest that, given training data in the form of a (relatively small) subgraph of the Web containing a subset of a selected class of tar...
Joel Young, Thomas Dean