Sciweavers

84 search results - page 8 / 17
» Managing duplicates in a web archive
Sort
View
WWW
2007
ACM
16 years 18 days ago
Designing efficient sampling techniques to detect webpage updates
Due to resource constraints, Web archiving systems and search engines usually have difficulties keeping the entire local repository synchronized with the Web. We advance the state...
Qingzhao Tan, Ziming Zhuang, Prasenjit Mitra, C. L...
AUSDM
2006
Springer
97views Data Mining» more  AUSDM 2006»
15 years 3 months ago
Tracking the Changes of Dynamic Web Pages in the Existence of URL Rewriting
Crawlers in a knowledge management system need to collect and archive documents from websites, and also track the change status of these documents. However, the existence of URL r...
Ping-Jer Yeh, Jie-Tsung Li, Shyan-Ming Yuan
ELPUB
2004
ACM
15 years 5 months ago
What academic libraries need from e-publishers
tions, allowing interlinking of abstracting and indexing databases with full-text sources, and providing the ability to search across multiple databases simultaneously. Publishers ...
Claire Dygert
IPPS
2002
IEEE
15 years 4 months ago
FARM: A Feedback-Based Adaptive Resource Management for Autonomous Hot-Spot Convergence System
Abstract— In this paper, we present a novel and comprehensive resource management solution for the autonomous hot-spot convergence system (AHSCS) that uses sensor web. This solut...
S. Swaminathan, G. Manimaran
SIGIR
2010
ACM
15 years 3 months ago
Freshness matters: in flowers, food, and web authority
The collective contributions of billions of users across the globe each day result in an ever-changing web. In verticals like news and real-time search, recency is an obvious sign...
Na Dai, Brian D. Davison