Sciweavers

CORIA
2008

Involving Validity Indices in Document Clustering

13 years 6 months ago
Involving Validity Indices in Document Clustering
The goal of any clustering algorithm is to find the optimal clustering solution with the optimal number of clusters. In order to evaluate a clustering solution, a number of validity indices are used during or at the end of a clustering process. They can be internal, external or relative. In this paper, we provide two main contributions: First, we present an experimental study comparing the major relative indices in the context of document agglomerative clustering. The objective is to highlight the limits of the existing indices for identifying both the optimal clustering solution and the optimal number of clusters in real datasets. Second, we explore the feasibility of using the relative indices as stopping criteria in agglomerative clustering algorithms. We present a new method that enhances the clustering process with context-awareness to improve their reliability for such utilization. MOTS-CL
Ahmad El Sayed, Hakim Hacid, Djamel A. Zighed
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2008
Where CORIA
Authors Ahmad El Sayed, Hakim Hacid, Djamel A. Zighed
Comments (0)