Sciweavers

84 search results - page 4 / 17
» Managing duplicates in a web archive
Sort
View
CORR
2010
Springer
140views Education» more  CORR 2010»
14 years 10 months ago
An HTTP-Based Versioning Mechanism for Linked Data
Dereferencing a URI returns a representation of the current state of the resource identified by that URI. But, on the Web representations of prior states of a resource are also av...
Herbert Van de Sompel, Robert Sanderson, Michael L...
PVLDB
2008
93views more  PVLDB 2008»
14 years 9 months ago
Large-scale collaborative analysis and extraction of web data
Archived web data is a great resource for scientific research, but poses serious challenges in data processing and management. We demonstrate the Web Lab Collaboration Server, a p...
Felix Weigel, Biswanath Panda, Mirek Riedewald, Jo...
68
Voted
WWW
2003
ACM
15 years 10 months ago
Web-R: a Tool to Record & Replay Personal Web Navigation
This poster presents a useful tool to capture the content of browsing sessions. Web-R saves systematically all the components sufficient and necessary to visualize offline the pag...
Jean-Daniel Kant, Alain Lifchitz
SEMWEB
2007
Springer
15 years 4 months ago
Doris: Managing Document-based Knowledge in Large Organisations via Semantic Web Technologies
The acquisition, sharing and reuse of knowledge is a prime challenge in large organisations. Doris is a framework for defining Knowledge Management applications based on Semantic W...
Ravish Bhagdev, Jonathan Butters, Ajay Chakravarth...
77
Voted
WWW
2008
ACM
15 years 10 months ago
iRobot: an intelligent crawler for web forums
We study in this paper the Web forum crawling problem, which is a very fundamental step in many Web applications, such as search engine and Web data mining. As a typical user-crea...
Rui Cai, Jiang-Ming Yang, Wei Lai, Yida Wang, Lei ...