Feature diversity in cluster ensembles for robust document clustering

15 years 11 months ago

Download serpens.salleurl.edu

The performance of document clustering systems depends on employing optimal text representations, which are not only diﬃcult to determine beforehand, but also may vary from one clustering problem to another. As a ﬁrst step towards building robust document clusterers, a strategy based on feature diversity and cluster ensembles is presented in this work. Experiments conducted on a binary clustering problem show that our method is robust to near-optimal model order selection and able to detect constructive interactions between diﬀerent document representations in the test bed. Categories and Subject Descriptors I.2.7 [Artiﬁcial Intelligence]: Natural Language Processing—Text Analysis; I.5.3 [Pattern Recognition]: Clustering—Algorithms General Terms Algorithms, Design, Experimentation, Performance Keywords Document clustering, feature extraction, cluster ensembles

Xavier Sevillano, Germán Cobo, Francesc Al&

Real-time Traffic

Clustering Problem | Document Clustering | Document Clustering Systems | SIGIR 2006 |

claim paper

» Consensus Clusterings

» Random Projection for High Dimensional Data Clustering A Cluster Ensemble Approach

» A Hierarchical Consensus Architecture for Robust Document Clustering

» Selecting Diversifying Heuristics for Cluster Ensembles

» Graph ClusteringBased Ensemble Method for Handwritten Text Line Segmentation

» Document clustering via dirichlet process mixture model with feature selection

» Registering a MultiSensor Ensemble of Images

» Cluster Ensembles A Knowledge Reuse Framework for Combining Multiple Partitions

Post Info
More Details (n/a)

Added	14 Jun 2010
Updated	14 Jun 2010
Type	Conference
Year	2006
Where	SIGIR
Authors	Xavier Sevillano, Germán Cobo, Francesc Alías, Joan Claudi Socoró

Comments (0)

Sciweavers

Feature diversity in cluster ensembles for robust document clustering

Clustering Problem | Document Clustering | Document Clustering Systems | SIGIR 2006 |

Explore & Download

Productivity Tools

Sciweavers