Sciweavers

SSDBM
2008
IEEE

Summarizing Two-Dimensional Data with Skyline-Based Statistical Descriptors

13 years 10 months ago
Summarizing Two-Dimensional Data with Skyline-Based Statistical Descriptors
Much real data consists of more than one dimension, such as financial transactions (eg, price × volume) and IP network flows (eg, duration × numBytes), and capture relationships between the variables. For a single dimension, quantiles are intuitive and robust descriptors. Processing and analyzing such data, particularly in data warehouse or data streaming settings, requires similarly robust and informative statistical descriptors that go beyond one-dimension. Applying quantile methods to summarize a multidimensional distribution along only singleton attributes ignores the rich dependence amongst the variables. In this paper, we present new skyline-based statistical descriptors for capturing the distributions over pairs of dimensions. They generalize the notion of quantiles in the individual dimensions, and also incorporate properties of the joint distribution. We introduce φ-quantours and α-radials, which are skyline points over subsets of the data, and propose (φ, α)-quantiles...
Graham Cormode, Flip Korn, S. Muthukrishnan, Dives
Added 01 Jun 2010
Updated 01 Jun 2010
Type Conference
Year 2008
Where SSDBM
Authors Graham Cormode, Flip Korn, S. Muthukrishnan, Divesh Srivastava
Comments (0)