Sciweavers

SIGMOD
2010
ACM

Hierarchically organized skew-tolerant histograms for geographic data objects

13 years 4 months ago
Hierarchically organized skew-tolerant histograms for geographic data objects
Histograms have been widely used for fast estimation of query result sizes in query optimization. In this paper, we propose a new histogram method, called the Skew-Tolerant Histogram (STHistogram) for two or three dimensional geographic data objects that are used in many real-world applications in practice. The proposed method provides a significantly enhanced accuracy in a robust manner even for the data set that has a highly skewed distribution. Our method detects hotspots present in various parts of a data set and exploits them in organizing histogram buckets. For this purpose, we first define the concept of a hotspot, and provide an algorithm that efficiently extracts hotspots from the given data set. Then, we present our histogram construction method that utilizes hotspot information. We also describe how to estimate query result sizes by using the proposed histogram. We show through extensive performance experiments that the proposed method provides better performance than other...
Yohan J. Roh, Jae Ho Kim, Yon Dohn Chung, Jin Hyun
Added 06 Dec 2010
Updated 06 Dec 2010
Type Conference
Year 2010
Where SIGMOD
Authors Yohan J. Roh, Jae Ho Kim, Yon Dohn Chung, Jin Hyun Son, Myoung-Ho Kim
Comments (0)