Clusterer ensemble

9 years 2 months ago
Clusterer ensemble
Ensemble methods that train multiple learners and then combine their predictions have been shown to be very effective in supervised learning. This paper explores ensemble methods for unsupervised learning. Here an ensemble comprises multiple clusterers, each of which is trained by k-means algorithm with different initial points. The clusters discovered by different clusterers are aligned, i.e. similar clusters are assigned with the same label, by counting their overlapped data items. Then, four methods are developed to combine the aligned clusterers. Experiments show that clustering performance could be significantly improved by ensemble methods, where utilizing mutual information to select a subset of clusterers for weighted voting is a nice choice. Since the proposed methods work by analyzing the clustering results instead of the internal mechanisms of the component clusterers, they are applicable to diverse kinds of clustering algorithms.
Zhi-Hua Zhou, Wei Tang
Added 13 Dec 2010
Updated 13 Dec 2010
Type Journal
Year 2006
Where KBS
Authors Zhi-Hua Zhou, Wei Tang
Comments (0)