Sciweavers

1435 search results - page 167 / 287
» Generalization Error Bounds Using Unlabeled Data
Sort
View
77
Voted
UIST
2010
ACM
14 years 8 months ago
Designing adaptive feedback for improving data entry accuracy
Data quality is critical for many information-intensive applications. One of the best opportunities to improve data quality is during entry. USHER provides a theoretical, data-dri...
Kuang Chen, Joseph M. Hellerstein, Tapan S. Parikh
SIGMOD
2011
ACM
205views Database» more  SIGMOD 2011»
14 years 28 days ago
Interaction between record matching and data repairing
Central to a data cleaning system are record matching and data repairing. Matching aims to identify tuples that refer to the same real-world object, and repairing is to make a dat...
Wenfei Fan, Jianzhong Li, Shuai Ma, Nan Tang, Weny...
TIT
2002
86views more  TIT 2002»
14 years 9 months ago
Lagrangian empirical design of variable-rate vector quantizers: consistency and convergence rates
Abstract--The Lagrangian formulation of variable-rate vector quantization is known to yield useful necessary conditions for quantizer optimality and generalized Lloyd algorithms fo...
Tamás Linder
SIGMOD
2002
ACM
198views Database» more  SIGMOD 2002»
15 years 10 months ago
Processing complex aggregate queries over data streams
Recent years have witnessed an increasing interest in designing algorithms for querying and analyzing streaming data (i.e., data that is seen only once in a fixed order) with only...
Alin Dobra, Minos N. Garofalakis, Johannes Gehrke,...
IDEAL
2004
Springer
15 years 3 months ago
Combining Local and Global Models to Capture Fast and Slow Dynamics in Time Series Data
Many time series exhibit dynamics over vastly different time scales. The standard way to capture this behavior is to assume that the slow dynamics are a “trend”, to de-trend t...
Michael Small