Sciweavers

3879 search results - page 678 / 776
» PyPBS design and methodologies
Sort
View
148
Voted
KDD
2005
ACM
137views Data Mining» more  KDD 2005»
16 years 5 months ago
Pattern-based similarity search for microarray data
One fundamental task in near-neighbor search as well as other similarity matching efforts is to find a distance function that can efficiently quantify the similarity between two o...
Haixun Wang, Jian Pei, Philip S. Yu
KDD
2004
ACM
302views Data Mining» more  KDD 2004»
16 years 5 months ago
Redundancy based feature selection for microarray data
In gene expression microarray data analysis, selecting a small number of discriminative genes from thousands of genes is an important problem for accurate classification of diseas...
Lei Yu, Huan Liu
KDD
2003
ACM
152views Data Mining» more  KDD 2003»
16 years 5 months ago
Interactive exploration of coherent patterns in time-series gene expression data
Discovering coherent gene expression patterns in time-series gene expression data is an important task in bioinformatics research and biomedical applications. In this paper, we pr...
Daxin Jiang, Jian Pei, Aidong Zhang
KDD
2003
ACM
135views Data Mining» more  KDD 2003»
16 years 5 months ago
Efficiently handling feature redundancy in high-dimensional data
High-dimensional data poses a severe challenge for data mining. Feature selection is a frequently used technique in preprocessing high-dimensional data for successful data mining....
Lei Yu, Huan Liu
KDD
2002
ACM
126views Data Mining» more  KDD 2002»
16 years 5 months ago
Integrating feature and instance selection for text classification
Instance selection and feature selection are two orthogonal methods for reducing the amount and complexity of data. Feature selection aims at the reduction of redundant features i...
Dimitris Fragoudis, Dimitris Meretakis, Spiros Lik...