Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

76

PODS
2005
ACM

favoriteEmaildiscussreport

115views Database» more PODS 2005»

A divide-and-merge methodology for clustering

16 years 1 months ago

A divide-and-merge methodology for clustering

Download www.cs.yale.edu

We present a divide-and-merge methodology for clustering a set of objects that combines a top-down "divide" phase with a bottom-up "merge" phase. In contrast, previous algorithms use either top-down or bottom-up methods for constructing a hierarchical clustering or produce a flat clustering using local search (e.g. k-means). Our divide phase produces a tree whose leaves are the elements of the set. For this phase, we suggest an efficient spectral algorithm. The merge phase quickly finds the optimal partition that respects the tree for many natural objective functions, e.g., k-means, min-diameter, min-sum, correlation clustering, etc. We present a metasearch engine that clusters results from web searches. We also give empirical results on textbased data where the algorithm performs better than or competitively with existing clustering algorithms.

David Cheng, Santosh Vempala, Ravi Kannan, Grant W

Real-time Traffic

Database | Divide Phase | Hierarchical Clustering | Merge Phase | PODS 2005 |

claim paper

Related Content

» A Methodology for Analyzing Case Retrieval from a Clustered Case Memory

» Mining Structural Databases An Evolutionary MultiObjetive Conceptual Clustering Methodolog...

» Adaptive Automata Community Detection and Clustering A generic methodology

» A methodology for clustering XML documents by structure

» Clustering methodologies for identifying country core competencies

» A Performance Prediction Methodology for Datadependent Parallel Applications

» An improved methodology on information distillation by mining program source code

» KMutual Nearest Neighbour Approach for Clustering TwoDimensional Shapes Described by Fuzzy...

» Incrementally Assessing Cluster Tendencies with a Maximum Variance Cluster Algorithm

Post Info
More Details (n/a)

Added	08 Dec 2009
Updated	08 Dec 2009
Type	Conference
Year	2005
Where	PODS
Authors	David Cheng, Santosh Vempala, Ravi Kannan, Grant Wang

Comments (0)