Sciweavers

SIGIR
2006
ACM

Feature diversity in cluster ensembles for robust document clustering

13 years 10 months ago
Feature diversity in cluster ensembles for robust document clustering
The performance of document clustering systems depends on employing optimal text representations, which are not only difficult to determine beforehand, but also may vary from one clustering problem to another. As a first step towards building robust document clusterers, a strategy based on feature diversity and cluster ensembles is presented in this work. Experiments conducted on a binary clustering problem show that our method is robust to near-optimal model order selection and able to detect constructive interactions between different document representations in the test bed. Categories and Subject Descriptors I.2.7 [Artificial Intelligence]: Natural Language Processing—Text Analysis; I.5.3 [Pattern Recognition]: Clustering—Algorithms General Terms Algorithms, Design, Experimentation, Performance Keywords Document clustering, feature extraction, cluster ensembles
Xavier Sevillano, Germán Cobo, Francesc Al&
Added 14 Jun 2010
Updated 14 Jun 2010
Type Conference
Year 2006
Where SIGIR
Authors Xavier Sevillano, Germán Cobo, Francesc Alías, Joan Claudi Socoró
Comments (0)