A fast k-means implementation using coresets

15 years 7 months ago

Download www.frahling.de

In this paper we develop an efﬁcient implementation for a k-means clustering algorithm. The novel feature of our algorithm is that it uses coresets to speed up the algorithm. A coreset is a small weighted set of points that approximates the original point set with respect to the considered problem. The main strength of the algorithm is that it can quickly determine clusterings of the same point set for many values of k. This is necessary in many applications, since, typically, one does not know a good value for k in advance. Once we have clusterings for many different values of k we can determine a good choice of k using a quality measure of clusterings that is independent of k, for example the average silhouette coefﬁcient. The average silhouette coefﬁcient can be approximated using coresets. To evaluate the performance of our algorithm we compare it with algorithm KMHybrid [28] on typical 3D data sets for an image compression application and on artiﬁcially created instances....

Gereon Frahling, Christian Sohler

Real-time Traffic

Algorithm | Average Silhouette Coefﬁcient | COMPGEOM 2006 | K-means Clustering Algorithm |

claim paper

Post Info
More Details (n/a)

Added	13 Jun 2010
Updated	13 Jun 2010
Type	Conference
Year	2006
Where	COMPGEOM
Authors	Gereon Frahling, Christian Sohler

Comments (0)

Sciweavers

A fast k-means implementation using coresets

Algorithm | Average Silhouette Coefﬁcient | COMPGEOM 2006 | K-means Clustering Algorithm |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers