Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

10

SDM
2010
SIAM

favoriteEmaildiscussreport

181views Data Mining» more SDM 2010»

Making k-means Even Faster

13 years 6 months ago

Making k-means Even Faster

Download cs.baylor.edu

The k-means algorithm is widely used for clustering, compressing, and summarizing vector data. In this paper, we propose a new acceleration for exact k-means that gives the same answer, but is much faster in practice. Like Elkan's accelerated algorithm [8], our algorithm avoids distance computations using distance bounds and the triangle inequality. Our algorithm uses one novel lower bound for point-center distances, which allows it to eliminate the innermost k-means loop 80% of the time or more in our experiments. On datasets of low and medium dimension (e.g. up to 50 dimensions), our algorithm is much faster than other methods, including methods based on low-dimensional indexes, such as k-d trees. Other advantages are that it is very simple to implement and it has a very small memory overhead, much smaller than other accelerated algorithms.

Greg Hamerly

Real-time Traffic

Accelerated Algorithms | Algorithm | Data Mining | K-means | SDM 2010 |

claim paper

Related Content

» Making fast buffer insertion even faster via approximation techniques

» Faster Joins Self Joins and MultiWay Joins Using Join Indices

» Faster topk document retrieval using blockmax indexes

» Reaching fast code faster using modeling for efficient software thread integration on a VL...

» Faster approximation algorithms for the minimum latency problem

» A Faster Better Approximation Algorithm for the Minimum Latency Problem

» Iterative Incremental Clustering of Time Series

» FREAK Fast Retina Keypoint

» The Rio File Cache Surviving Operating System Crashes

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	SDM
Authors	Greg Hamerly

Comments (0)