Sciweavers

436 search results - page 6 / 88
» Estimating the Quality of Data in Relational Databases
Sort
View
DATASCIENCE
2006
110views more  DATASCIENCE 2006»
14 years 9 months ago
Towards development of a high quality public domain global roads database
There is clear demand for a global spatial public domain roads data set with improved geographic and temporal coverage, consistent coding of road types, and clear documentation of...
Andrew Nelson 0002, Alexander de Sherbinin, France...
158
Voted
SIGMOD
2005
ACM
107views Database» more  SIGMOD 2005»
15 years 9 months ago
Relational Confidence Bounds Are Easy With The Bootstrap
Statistical estimation and approximate query processing have become increasingly prevalent applications for database systems. However, approximation is usually of little use witho...
Abhijit Pol, Chris Jermaine
VLDB
1995
ACM
135views Database» more  VLDB 1995»
15 years 1 months ago
Sampling-Based Estimation of the Number of Distinct Values of an Attribute
We provide several new sampling-based estimators of the number of distinct values of an attribute in a relation. We compare these new estimators to estimators from the database an...
Peter J. Haas, Jeffrey F. Naughton, S. Seshadri, L...
ER
2008
Springer
106views Database» more  ER 2008»
14 years 11 months ago
Towards a Compositional Semantic Account of Data Quality Attributes
We address the fundamental question: what does it mean for data in a database to be of high quality? We motivate our discussion with examples, where traditional views on data quali...
Lei Jiang, Alexander Borgida, John Mylopoulos
COLT
2000
Springer
15 years 2 months ago
Model Selection and Error Estimation
We study model selection strategies based on penalized empirical loss minimization. We point out a tight relationship between error estimation and data-based complexity penalizatio...
Peter L. Bartlett, Stéphane Boucheron, G&aa...