Sciweavers

PAKDD
2010
ACM

Efficient Pattern Mining of Uncertain Data with Sampling

13 years 6 months ago
Efficient Pattern Mining of Uncertain Data with Sampling
Mining frequent itemsets from transactional datasets is a well known problem with good algorithmic solutions. In the case of uncertain data, however, several new techniques have been proposed. Unfortunately, these proposals often suffer when a lot of items occur with many different probabilities. Here we propose an approach based on sampling by instantiating "possible worlds" of the uncertain data, on which we subsequently run optimized frequent itemset mining algorithms. As such we gain efficiency at a surprisingly low loss in accuracy. These is confirmed by a statistical and an empirical evaluation on real and synthetic data.
Toon Calders, Calin Garboni, Bart Goethals
Added 14 Oct 2010
Updated 14 Oct 2010
Type Conference
Year 2010
Where PAKDD
Authors Toon Calders, Calin Garboni, Bart Goethals
Comments (0)