SPROUT: Lazy vs. Eager Query Plans for Tuple-Independent Probabilistic Databases

13 years 11 months ago

Download www.comlab.ox.ac.uk

— A paramount challenge in probabilistic databases is the scalable computation of conﬁdences of tuples in query results. This paper introduces an efﬁcient secondary-storage operator for exact computation of queries on tuple-independent probabilistic databases. We consider the conjunctive queries without self-joins that are known to be tractable on any tupleindependent database, and queries that are not tractable in general but become tractable on probabilistic databases restricted by functional dependencies. Our operator is semantically equivalent to a sequence of aggregations and can be naturally integrated into existing relational query plans. As a proof of concept, we developed an extension of the PostgreSQL 8.3.3 query engine called SPROUT. We study optimizations that push or pull our operator or parts thereof past joins. The operator employs static information, such as the query structure and functional dependencies, to decide which constituent aggregations can be evaluated ...

Dan Olteanu, Jiewen Huang, Christoph Koch

Real-time Traffic

Database | ICDE 2009 | Operator | Probabilistic Databases | Tuple-independent Probabilistic Databases |

claim paper

Added	19 May 2010
Updated	19 May 2010
Type	Conference
Year	2009
Where	ICDE
Authors	Dan Olteanu, Jiewen Huang, Christoph Koch

Sciweavers

SPROUT: Lazy vs. Eager Query Plans for Tuple-Independent Probabilistic Databases

Database | ICDE 2009 | Operator | Probabilistic Databases | Tuple-independent Probabilistic Databases |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers