Sciweavers

EDBT
1998
ACM

Improving the Query Performance of High-Dimensional Index Structures by Bulk-Load Operations

13 years 7 months ago
Improving the Query Performance of High-Dimensional Index Structures by Bulk-Load Operations
Abstract. In this paper, we propose a new bulk-loading technique for high-dimensional indexes which represent an important component of multimedia database systems. Since it is very inefficient to construct an index for a large amount of data by dynamic insertion of single objects, there is an increasing interest in bulk-loading techniques. In contrast to previous approaches, our technique exploits a priori knowledge of the complete data set to improve both construction time and query performance. Our algorithm operates in a mannar similar to the Quicksort algorithm and has an average runtime complexity of O(n log n). We additionally improve the query performance by optimizing the shape of the bounding boxes, by completely avoiding overlap, and by clustering the pages on disk. As we analytically show, the split strategy typically used in dynamic index structures, splitting the data space at the 50%-quantile, results in a bad query performance in high-dimensional spaces. Therefore, we u...
Stefan Berchtold, Christian Böhm, Hans-Peter
Added 05 Aug 2010
Updated 05 Aug 2010
Type Conference
Year 1998
Where EDBT
Authors Stefan Berchtold, Christian Böhm, Hans-Peter Kriegel
Comments (0)