Sciweavers

IMSCCS
2006
IEEE

Clustering of Gene Expression Data: Performance and Similarity Analysis

13 years 10 months ago
Clustering of Gene Expression Data: Performance and Similarity Analysis
Background: DNA Microarray technology is an innovative methodology in experimental molecular biology, which has produced huge amounts of valuable data in the profile of gene expression. Many clustering algorithms have been proposed to analyze gene expression data, but little guidance is available to help choose among them. The evaluation of feasible and applicable clustering algorithms is becoming an important issue in today's bioinformatics research. Results: In this paper we first experimentally study three major clustering algorithms: Hierarchical Clustering (HC), Self-Organizing Map (SOM), and Self Organizing Tree Algorithm (SOTA) using Yeast Saccharomyces cerevisiae gene expression data, and compare their performance. We then introduce Cluster Diff, a new data mining tool, to conduct the similarity analysis of clusters generated by different algorithms. The performance study shows that SOTA is more efficient than SOM while HC is the least efficient. The results of similarity...
Longde Yin, Chun-Hsi Huang
Added 11 Jun 2010
Updated 11 Jun 2010
Type Conference
Year 2006
Where IMSCCS
Authors Longde Yin, Chun-Hsi Huang
Comments (0)