Computing Iceberg Queries Efficiently

13 years 9 months ago

Download www.vldb.org

Many applications compute aggregate functions over an attribute (or set of attributes) to find aggregate values above some specified threshold. We call such queries iceberg queries, because the number of abovethreshold results is often very small (the tip of an iceberg), relative to the large amount of input data (the iceberg). Such iceberg queries are common in many applications, including data warehousing, information-retrieval, market basket analysis in data mining, clustering and copy detection. We propose efficient algorithms to evaluate iceberg queries using very little memory and significantly fewer passes over data, when compared to current techniques that use sorting or hashing. We present an experimental case study using over three gigabytes of Web data to illustrate the savings obtained by our algorithms.

Min Fang, Narayanan Shivakumar, Hector Garcia-Moli

Real-time Traffic

Aggregate Functions | Database | Iceberg Queries | Such Iceberg Queries | VLDB 1998 |

claim paper

» Finding global icebergs over distributed data sets

» Iceberg Query Lattices for Datalog

» Computing iceberg concept lattices with T

» CCubing Efficient Computation of Closed Cubes by AggregationBased Checking

» cgmOLAP Efficient Parallel Generation and Querying of Terabyte Size ROLAP Data Cubes

» Ixcubes iceberg cubes for data warehousing and olap on xml data

» Constructing Iceberg Lattices from Frequent Closures Using Generators

» BitCube A BottomUp Cubing Engineering

Post Info
More Details (n/a)

Added	06 Aug 2010
Updated	06 Aug 2010
Type	Conference
Year	1998
Where	VLDB
Authors	Min Fang, Narayanan Shivakumar, Hector Garcia-Molina, Rajeev Motwani, Jeffrey D. Ullman

Comments (0)

Sciweavers

Computing Iceberg Queries Efficiently

Aggregate Functions | Database | Iceberg Queries | Such Iceberg Queries | VLDB 1998 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers