Aggregation in Probabilistic Databases via Knowledge Compilation

14 years 3 months ago

Download www.robertfink.de

This paper presents a query evaluation technique for positive relational algebra queries with aggregates on a representation system for probabilistic data based on the algebraic structures of semiring and semimodule. The core of our evaluation technique is a procedure that compiles semimodule and semiring expressions into so-called decomposition trees, for which the computation of the probability distribution can be done in time linear in the product of the sizes of the probability distributions represented by its nodes. We give syntactic characterisations of tractable queries with aggregates by exploiting the connection between query tractability and polynomial-time decomposition trees. A prototype of the technique is incorporated in the probabilistic database engine SPROUT. We report on performance experiments with custom datasets and TPC-H data.

Robert Fink, Larisa Han, Dan Olteanu

Real-time Traffic

CORR 2012 | Custom Datasets | Education | Polynomial Time | Probability Distributions |

claim paper

Post Info
More Details (n/a)

Added	20 Apr 2012
Updated	20 Apr 2012
Type	Journal
Year	2012
Where	CORR
Authors	Robert Fink, Larisa Han, Dan Olteanu

Comments (0)

Sciweavers

Aggregation in Probabilistic Databases via Knowledge Compilation

CORR 2012 | Custom Datasets | Education | Polynomial Time | Probability Distributions |

Explore & Download

Productivity Tools

Sciweavers