Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

75

KDD
2004
ACM

favoriteEmaildiscussreport

103views Data Mining» more KDD 2004»

An objective evaluation criterion for clustering

15 years 10 months ago

An objective evaluation criterion for clustering

Download hunch.net

We propose and test an objective criterion for evaluation of clustering performance: How well does a clustering algorithm run on unlabeled data aid a classification algorithm? The accuracy is quantified using the PAC-MDL bound [3] in a semisupervised setting. Clustering algorithms which naturally separate the data according to (hidden) labels with a small number of clusters perform well. A simple extension of the argument leads to an objective model selection method. Experimental results on text analysis datasets demonstrate that this approach empirically results in very competitive bounds on test set performance on natural datasets. Categories and Subject Descriptors: I.5.3 [Pattern Recognition]: Clustering

Arindam Banerjee, John Langford

Real-time Traffic

Clustering Algorithm | Data Mining | KDD 2004 | Text Analysis Datasets | Unlabeled Data Aid |

claim paper

Related Content

» Clustering Transactions Using Large Items

» Soft clustering criterion functions for partitional document clustering a summary of resul...

» Performance Assessment of Some Clustering Algorithms Based on a Fuzzy GranulationDegranula...

» A monothetic clustering method

» Clustering Multidimensional Extended Objects to Speed Up Execution of Spatial Queries

» Speaker Diarization Exploiting the Eigengap Criterion and Cluster Ensembles

» Evaluation of Topographic Clustering and Its Kernelization

» A mutual information based approach for evaluating the quality of clustering

» Evaluation of Clustering Algorithms for Polish Word Sense Disambiguation

Post Info
More Details (n/a)

Added	30 Nov 2009
Updated	30 Nov 2009
Type	Conference
Year	2004
Where	KDD
Authors	Arindam Banerjee, John Langford

Comments (0)