Sciweavers

KDD
2003
ACM
113views Data Mining» more  KDD 2003»
14 years 5 months ago
Using randomized response techniques for privacy-preserving data mining
Privacy is an important issue in data mining and knowledge discovery. In this paper, we propose to use the randomized response techniques to conduct the data mining computation. S...
Wenliang Du, Zhijun Zhan
KDD
2003
ACM
109views Data Mining» more  KDD 2003»
14 years 5 months ago
Generative model-based clustering of directional data
High dimensional directional data is becoming increasingly important in contemporary applications such as analysis of text and gene-expression data. A natural model for multivaria...
Arindam Banerjee, Inderjit S. Dhillon, Joydeep Gho...
KDD
2003
ACM
214views Data Mining» more  KDD 2003»
14 years 5 months ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney
KDD
2003
ACM
148views Data Mining» more  KDD 2003»
14 years 5 months ago
A highly-usable projected clustering algorithm for gene expression profiles
Projected clustering has become a hot research topic due to its ability to cluster high-dimensional data. However, most existing projected clustering algorithms depend on some cri...
Kevin Y. Yip, David W. Cheung, Michael K. Ng
KDD
2003
ACM
122views Data Mining» more  KDD 2003»
14 years 5 months ago
Enhanced visualization of time series through higher fourier harmonics
Li Zhang, Aidong Zhang, Murali Ramanathan
KDD
2003
ACM
133views Data Mining» more  KDD 2003»
14 years 5 months ago
Interactive Analysis of Gene Interactions Using Graphical gaussian model
DNA microarray provides a powerful basis for analysis of gene expression. Data mining methods such as clustering have been widely applied to microarray data to link genes that sho...
Xintao Wu, Yong Ye, Kalpathi R. Subramanian
KDD
2003
ACM
142views Data Mining» more  KDD 2003»
14 years 5 months ago
Extracting information from text and images for location proteomics
There is extensive interest in automating the collection, organization and summarization of biological data. Data in the form of figures and accompanying captions in literature pr...
Zhenzhen Kou, William W. Cohen, Robert F. Murphy
KDD
2003
ACM
190views Data Mining» more  KDD 2003»
14 years 5 months ago
Distance-enhanced association rules for gene expression
We introduce a novel data mining technique for the analysis of gene expression. Gene expression is the effective production of the protein that a gene encodes. We focus on the cha...
Aleksandar Icev, Carolina Ruiz, Elizabeth F. Ryder