Sciweavers

ICDT
2001
ACM

Mining for Empty Rectangles in Large Data Sets

13 years 9 months ago
Mining for Empty Rectangles in Large Data Sets
Abstract. Many data mining approaches focus on the discovery of similar (and frequent) data values in large data sets. We present an alternative, but complementary approach in which we search for empty regions in the data. We consider the problem of finding all maximal empty rectangles in large, two-dimensional data sets. We introduce a novel, scalable algorithm for finding all such rectangles. The algorithm achieves this with a single scan over a sorted data set and requires only a small bounded amount of memory. We also describe an algorithm to find all maximal empty hyper-rectangles in a multi-dimensional space. We consider the complexity of this search problem and present new bounds on the number of maximal empty hyper-rectangles. We briefly overview experimental results obtained by applying our algorithm to a synthetic data set.
Jeff Edmonds, Jarek Gryz, Dongming Liang, Ren&eacu
Added 29 Jul 2010
Updated 29 Jul 2010
Type Conference
Year 2001
Where ICDT
Authors Jeff Edmonds, Jarek Gryz, Dongming Liang, Renée J. Miller
Comments (0)