Sciweavers

11 search results - page 1 / 3
» Understanding Content Reuse on the Web: Static and Dynamic A...
Sort
View
KDD
2006
ACM
185views Data Mining» more  KDD 2006»
14 years 4 months ago
Understanding Content Reuse on the Web: Static and Dynamic Analyses
Abstract. In this paper we present static and dynamic studies of duplicate and near-duplicate documents in the Web. The static and dynamic studies involve the analysis of similar c...
Ricardo A. Baeza-Yates, Álvaro R. Pereira J...
WSDM
2009
ACM
176views Data Mining» more  WSDM 2009»
13 years 11 months ago
The web changes everything: understanding the dynamics of web content
The Web is a dynamic, ever changing collection of information. This paper explores changes in Web content by analyzing a crawl of 55,000 Web pages, selected to represent different...
Eytan Adar, Jaime Teevan, Susan T. Dumais, Jonatha...
AIMSA
2004
Springer
13 years 10 months ago
Towards a Better Understanding of the Language Content in the Semantic Web
Internet content today is about 80% text-based. No matter static or dynamic, the information is encoded and presented as multilingual, unstructured natural language text pages. As ...
Pavlin Dobrev, Albena Strupchanska, Galia Angelova
WSE
2003
IEEE
13 years 9 months ago
Resolution of Static Clones in Dynamic Web Pages
Cloning is extremely likely to occur in web sites, much more so than in other software. While some clones exist for valid reasons, or are too small to eliminate, cloning percentag...
Nikita Synytskyy, James R. Cordy, Thomas R. Dean
CN
1999
126views more  CN 1999»
13 years 4 months ago
Towards a Better Understanding of Web Resources and Server Responses for Improved Caching
This work focuses on characterizing information about Web resources and server responses that is relevant to Web caching. The approach is to study a set of URLs at a variety of si...
Craig E. Wills, Mikhail Mikhailov