Sciweavers

EDBTW
2010
Springer
13 years 3 months ago
Using visual pages analysis for optimizing web archiving
Due to the growing importance of the World Wide Web, archiving it has become crucial for preserving useful source of information. To maintain a web archive up-to-date, crawlers ha...
Myriam Ben Saad, Stéphane Gançarski
DEXA
2010
Springer
226views Database» more  DEXA 2010»
13 years 3 months ago
Vi-DIFF: Understanding Web Pages Changes
Nowadays, many applications are interested in detecting and discovering changes on the web to help users to understand page updates and more generally, the web dynamics. Web archiv...
Zeynep Pehlivan, Myriam Ben Saad, Stéphane ...
PVLDB
2008
137views more  PVLDB 2008»
13 years 4 months ago
Flashing up the storage layer
In the near future, commodity hardware is expected to incorporate both flash and magnetic disks. In this paper we study how the storage layer of a database system can benefit from...
Ioannis Koltsidas, Stratis Viglas
IVS
2008
93views more  IVS 2008»
13 years 4 months ago
Visual analysis of controversy in user-generated encyclopedias
Wikipedia is a large and rapidly growing Web-based collaborative authoring environment, where anyone on the Internet can create, modify, and delete pages about encyclopedic topics...
Ulrik Brandes, Jürgen Lerner
SAC
2002
ACM
13 years 4 months ago
Dynamically generating web application fragments from page templates
Web-based applications are typically required to be highly customizable and configurable. New application requirements have to be introduced rapidly, often without stopping the ru...
Uwe Zdun
CSUR
1999
159views more  CSUR 1999»
13 years 4 months ago
Hubs, authorities, and communities
The Web can be naturally modeled as a directed graph, consisting of a set of abstract nodes (the pages) joined by directional edges (the hyperlinks). Hyperlinks encode a considerab...
Jon M. Kleinberg
INTR
2002
50views more  INTR 2002»
13 years 4 months ago
Methodologies for crawler based Web surveys
There have been many attempts to study the content of the web, either through human or automatic agents. Five different previously used web survey methodologies are described and ...
Mike Thelwall
JODL
2000
137views more  JODL 2000»
13 years 4 months ago
Automatic page analysis for the creation of a digital library from newspaper archives
Digital preservation of newspaper archives aims both at the salvation of endangered material (paper) and at the creation of digital library services that will allow full utilizatio...
Basilios Gatos, S. L. Mantzaris, Stavros J. Perant...
DEBU
2002
135views more  DEBU 2002»
13 years 4 months ago
Analyzing Fine-grained Hypertext Features for Enhanced Crawling and Topic Distillation
Early Web search engines closely resembled Information Retrieval (IR) systems which had matured over several decades. Around 1996
Soumen Chakrabarti, Ravindra Jaju
CACM
2000
120views more  CACM 2000»
13 years 4 months ago
Universal Usability
ost abstract sense, we build web pages so that computers can read them. The software that people use to access web pages is what "reads" the document. How the page is ren...
Ben Shneiderman