Sciweavers

ICDT
2009
ACM

Analysis of sampling techniques for association rule mining

14 years 5 months ago
Analysis of sampling techniques for association rule mining
In this paper, we present a comprehensive theoretical analysis of the sampling technique for the association rule mining problem. Most of the previous works have concentrated only on the empirical evaluation of the effectiveness of sampling for the step of finding frequent itemsets. To the best of our knowledge, a theoretical framework to analyze the quality of the solutions obtained by sampling has not been studied. Our contributions are two-fold. First, we present the notions of -close frequent itemset mining and -close association rule mining that help assess the quality of the solutions obtained by sampling. Secondly, we show that both the frequent items mining and association rule mining problems can be solved satisfactorily with a sample size that is independent of both the number of transactions size and the number of items. Let be the required support, the closeness parameter, and 1/h the desired bound on the probability of failure. We show that the sampling based analysis su...
Venkatesan T. Chakaravarthy, Vinayaka Pandit, Yogi
Added 21 Nov 2009
Updated 21 Nov 2009
Type Conference
Year 2009
Where ICDT
Authors Venkatesan T. Chakaravarthy, Vinayaka Pandit, Yogish Sabharwal
Comments (0)