Sciweavers

922 search results - page 125 / 185
» A data mining approach to database compression
Sort
View
PKDD
1999
Springer
103views Data Mining» more  PKDD 1999»
15 years 2 months ago
An Evolutionary Algorithm Using Multivariate Discretization for Decision Rule Induction
Abstract. We describe EDRL-MD, an evolutionary algorithm-based system, for learning decision rules from databases. The main novelty of our approach lies in dealing with continuous ...
Wojciech Kwedlo, Marek Kretowski
NIPS
2007
14 years 11 months ago
Mining Internet-Scale Software Repositories
Large repositories of source code create new challenges and opportunities for statistical machine learning. Here we first develop Sourcerer, an infrastructure for the automated c...
Erik Linstead, Paul Rigor, Sushil Krishna Bajracha...
KDD
2005
ACM
107views Data Mining» more  KDD 2005»
15 years 3 months ago
Cross-relational clustering with user's guidance
Clustering is an essential data mining task with numerous applications. However, data in most real-life applications are high-dimensional in nature, and the related information of...
Xiaoxin Yin, Jiawei Han, Philip S. Yu
IQ
2007
14 years 11 months ago
Rule-Based Measurement Of Data Quality In Nominal Data
: Sufficiently high data quality is crucial for almost every application. Nonetheless, data quality issues are nearly omnipresent. The reasons for poor quality cannot simply be bla...
Jochen Hipp, Markus Müller, Johannes Hohendor...
ICDE
2009
IEEE
392views Database» more  ICDE 2009»
16 years 9 months ago
FF-Anonymity: When Quasi-Identifiers Are Missing
Existing approaches on privacy-preserving data publishing rely on the assumption that data can be divided into quasi-identifier attributes (QI) and sensitive attribute (SA). This ...
Ada Wai-Chee Fu, Ke Wang, Raymond Chi-Wing Wong, Y...