Indexing Incomplete Databases

10 years 11 months ago
Indexing Incomplete Databases
Incomplete databases, that is, databases that are missing data, are present in many research domains. It is important to derive techniques to access these databases efficiently. We first show that known indexing techniques for multi-dimensional data search break down in terms of performance when indexed attributes contain missing data. This paper utilizes two popularly employed indexing techniques, bitmaps and quantization, to correctly and efficiently answer queries in the presence of missing data. Query execution and interval evaluation are formalized for the indexing structures based on whether missing data is considered to be a query match or not. The performance of Bitmap indexes and quantization based indexes is evaluated and compared over a variety of analysis parameters for real and synthetic data sets. Insights into the conditions for which to use each technique are provided.
Guadalupe Canahuate, Michael Gibas, Hakan Ferhatos
Added 08 Dec 2009
Updated 08 Dec 2009
Type Conference
Year 2006
Where EDBT
Authors Guadalupe Canahuate, Michael Gibas, Hakan Ferhatosmanoglu
Comments (0)