Efficiently clustering transactional data with weighted coverage density

13 years 6 months ago

Download www.cs.wright.edu

In this paper, we propose a fast, memory-efficient, and scalable clustering algorithm for analyzing transactional data. Our approach has three unique features. First, we use the concept of Weighted Coverage Density as a categorical similarity measure for efficient clustering of transactional datasets. The concept of weighted coverage density is intuitive and allows the weight of each item in a cluster to be changed dynamically according to the occurrences of items. Second, we develop two transactional data clustering specific evaluation metrics based on the concept of large transactional items and the coverage density respectively. Third, we implement the weighted coverage density clustering algorithm and the two clustering validation metrics using a fully automated transactional clustering framework, called SCALE (Sampling, Clustering structure Assessment, cLustering and domain-specific Evaluation). The SCALE framework is designed to combine the weighted coverage density measure for ...

Hua Yan, Keke Chen, Ling Liu

Real-time Traffic

CIKM 2006 | Clustering Algorithms | Information Management | Scalable Clustering Algorithm | Weighted Coverage Density |

claim paper

Added	13 Oct 2010
Updated	13 Oct 2010
Type	Conference
Year	2006
Where	CIKM
Authors	Hua Yan, Keke Chen, Ling Liu

Sciweavers

Efficiently clustering transactional data with weighted coverage density

CIKM 2006 | Clustering Algorithms | Information Management | Scalable Clustering Algorithm | Weighted Coverage Density |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers