Parallelizing the Data Cube

15 years 5 months ago

Download users.encs.concordia.ca

This paper presents a general methodology for the efﬁcient parallelization of existing data cube construction algorithms. We describe two different partitioning strategies, one for top-down and one for bottomup cube algorithms. Both partitioning strategies assign subcubes to individual processors in such a way that the loads assigned to the processors are balanced. Our methods reduce inter processor communication overhead by partitioning the load in advance instead of computing each individual group-by in parallel. Our partitioning strategies create a small number of coarse tasks. This allows for sharing of preﬁxes and sort orders between different group-by computations. Our methods enable code reuse by permitting the use of existing sequential (external memory) data cube algorithms for the subcube computations on each processor. This supports the transfer of optimized sequential data cube code to a parallel setting. The bottom-up partitioning strategy balances the number of single...

Frank K. H. A. Dehne, Todd Eavis, Susanne E. Hambr

Real-time Traffic

Data Cube | Data Cube Construction | Database | ICDT 2001 | Partitioning Strategies |

claim paper

Related Content

» The Computation of Semantic Data Cube

» High Performance Data Mining Using Data Cubes on Parallel Computers

» Improved Data Partitioning for Building Large ROLAP Data Cubes in Parallel

» Parallel ROLAP Data Cube Construction On SharedNothing Multiprocessors

» cgmOLAP Efficient Parallel Generation and Querying of Terabyte Size ROLAP Data Cubes

» Communication and Memory Optimal Parallel Data Cube Construction

» Data Cube Materialization and Mining over MapReduce

» Parallel querying of ROLAP cubes in the presence of hierarchies

» Compressing Data Cube in Parallel OLAP Systems

Post Info
More Details (n/a)

Added	29 Jul 2010
Updated	29 Jul 2010
Type	Conference
Year	2001
Where	ICDT
Authors	Frank K. H. A. Dehne, Todd Eavis, Susanne E. Hambrusch, Andrew Rau-Chaplin

Comments (0)

Sciweavers

Parallelizing the Data Cube

Data Cube | Data Cube Construction | Database | ICDT 2001 | Partitioning Strategies |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers