Sciweavers

ICDE
2005
IEEE

Modeling and Managing Content Changes in Text Databases

14 years 5 months ago
Modeling and Managing Content Changes in Text Databases
Large amounts of (often valuable) information are stored in web-accessible text databases. "Metasearchers" provide unified interfaces to query multiple such databases at once. For efficiency, metasearchers rely on succinct statistical summaries of the database contents to select the best databases for each query. So far, database selection research has largely assumed that databases are static, so the associated statistical summaries do not need to change over time. However, databases are rarely static and the statistical summaries that describe their contents need to be updated periodically to reflect content changes. In this paper, we first report the results of a study showing how the content summaries of 152 real web databases evolved over a period of 52 weeks. Then, we show how to use "survival analysis" techniques in general, and Cox's proportional hazards regression in particular, to model database changes over time and predict when we should update eac...
Panagiotis G. Ipeirotis, Alexandros Ntoulas, Jungh
Added 01 Nov 2009
Updated 01 Nov 2009
Type Conference
Year 2005
Where ICDE
Authors Panagiotis G. Ipeirotis, Alexandros Ntoulas, Junghoo Cho, Luis Gravano
Comments (0)