Sciweavers

PVLDB
2010

Data Auditor: Exploring Data Quality and Semantics using Pattern Tableaux

13 years 2 months ago
Data Auditor: Exploring Data Quality and Semantics using Pattern Tableaux
We present Data Auditor, a tool for exploring data quality and data semantics. Given a rule or an integrity constraint and a target relation, Data Auditor computes pattern tableaux, which concisely summarize subsets of the relation that (mostly) satisfy or (mostly) fail the constraint. This paper describes 1) the architecture and user interface of Data Auditor, 2) the supported constraints for testing data consistency and completeness, 3) the heuristics used by Data Auditor to “tune” a given constraint or its associated parameters for better fit with the data, and 4) several demonstration scenarios. using real data sets.
Lukasz Golab, Howard J. Karloff, Flip Korn, Divesh
Added 30 Jan 2011
Updated 30 Jan 2011
Type Journal
Year 2010
Where PVLDB
Authors Lukasz Golab, Howard J. Karloff, Flip Korn, Divesh Srivastava
Comments (0)