Sciweavers

84 search results - page 1 / 17
» Managing duplicates in a web archive
Sort
View
44
Voted
SAC
2006
ACM
15 years 6 months ago
Managing duplicates in a web archive
Daniel Gomes, André L. Santos, Mário...
76
Voted
TMM
2010
112views Management» more  TMM 2010»
14 years 7 months ago
On the Annotation of Web Videos by Efficient Near-Duplicate Search
Wanlei Zhao, Xiao Wu, Chong-Wah Ngo
131
Voted
SIGIR
2008
ACM
15 years 19 days ago
SpotSigs: robust and efficient near duplicate detection in large web collections
Motivated by our work with political scientists who need to manually analyze large Web archives of news sites, we present SpotSigs, a new algorithm for extracting and matching sig...
Martin Theobald, Jonathan Siddharth, Andreas Paepc...
WWW
2006
ACM
16 years 1 months ago
Archiving web site resources: a records management view
In this paper, we propose the use of records management principles to identify and manage Web site resources with enduring value as records. Current Web archiving activities, coll...
Maureen Pennock, Brian Kelly
93
Voted
ELPUB
2007
ACM
15 years 4 months ago
Digitisation and Access to Archival Collections: A Case Study of the Sofia Municipal Government (1878-1879)
The paper presents in brief a project aimed at the development of a methodology and corresponding software tools intended for building of proper environments giving up means for s...
Maria Nisheva-Pavlova, Pavel Pavlov, Nikolay Marko...