Sciweavers

84 search results - page 5 / 17
» Managing duplicates in a web archive
Sort
View
JODL
2006
136views more  JODL 2006»
14 years 10 months ago
MIT's CWSpace project: packaging metadata for archiving educational content in DSpace
This paper describes work in progress on the research project CWSpace, sponsored by the MIT and Microsoft Research iCampus program, to investigate the metadata standards and protoc...
William Reilly, Robert Wolfe, MacKenzie Smith
JCDL
2006
ACM
128views Education» more  JCDL 2006»
15 years 4 months ago
Building a research library for the history of the web
This paper describes the building of a research library for studying the Web, especially research on how the structure and content of the Web change over time. The library is part...
William Y. Arms, Selcuk Aya, Pavel Dmitriev, Blaze...
DEXAW
2008
IEEE
161views Database» more  DEXAW 2008»
15 years 4 months ago
Model-Based QoS-Enabled Self-Healing Web Services
Failures during web service execution may depend on a wide variety of causes, such as network faults, server crashes, or application-related errors, such as unavailability of a re...
Olga Nabuco, Riadh Ben Halima, Khalil Drira, Maria...
WWW
2004
ACM
15 years 10 months ago
Web data integration using approximate string join
Web data integration is an important preprocessing step for web mining. It is highly likely that several records on the web whose textual representations differ may represent the ...
Yingping Huang, Gregory R. Madey
SIGUCCS
2004
ACM
15 years 3 months ago
Software's little helpers: managing your lab areas
There are always more labs and other things to attend to than available bodies to watch over said pesky details. How can we keep an eye on the ever-present large and small events ...
Doug Simpson