Sciweavers

4660 search results - page 232 / 932
» Learning from imperfect data
Sort
View
122
Voted
SIGMOD
2010
ACM
224views Database» more  SIGMOD 2010»
15 years 5 months ago
GDR: a system for guided data repair
Improving data quality is a time-consuming, labor-intensive and often domain specific operation. Existing data repair approaches are either fully automated or not efficient in int...
Mohamed Yakout, Ahmed K. Elmagarmid, Jennifer Nevi...
ICDM
2010
IEEE
160views Data Mining» more  ICDM 2010»
15 years 3 months ago
A Privacy Preserving Framework for Gaussian Mixture Models
Abstract--This paper presents a framework for privacypreserving Gaussian Mixture Model computations. Specifically, we consider a scenario where a central service wants to learn the...
Madhusudana Shashanka
KDD
2008
ACM
207views Data Mining» more  KDD 2008»
16 years 5 months ago
Active learning with direct query construction
Active learning may hold the key for solving the data scarcity problem in supervised learning, i.e., the lack of labeled data. Indeed, labeling data is a costly process, yet an ac...
Charles X. Ling, Jun Du
KDD
2010
ACM
222views Data Mining» more  KDD 2010»
15 years 7 months ago
Large linear classification when data cannot fit in memory
Recent advances in linear classification have shown that for applications such as document classification, the training can be extremely efficient. However, most of the existing t...
Hsiang-Fu Yu, Cho-Jui Hsieh, Kai-Wei Chang, Chih-J...
143
Voted
KDD
1995
ACM
129views Data Mining» more  KDD 1995»
15 years 8 months ago
Feature Extraction for Massive Data Mining
Techniques for learning from data typically require data to be in standard form. Measurements must be encoded in a numerical format such as binary true-or-false features, numerica...
V. Seshadri, Raguram Sasisekharan, Sholom M. Weiss