Sciweavers

IEEEHPCS
2010

Discovering closed frequent itemsets on multicore: Parallelizing computations and optimizing memory accesses

13 years 3 months ago
Discovering closed frequent itemsets on multicore: Parallelizing computations and optimizing memory accesses
The problem of closed frequent itemset discovery is a fundamental problem of data mining, having applications in numerous domains. It is thus very important to have efficient parallel algorithms to solve this probem, capable of efficiently harnessing the power of multicore processors that exists in our computers (notebooks as well as desktops). In this paper we present PLCMQS, a parallel algorithm based on the LCM algorithm, recognized as the most efficient algorithm for sequential discovery of closed frequent itemsets. We also present a simple yet powerfull parallelism interface based on the concept of Tuple Space, which allows an efficient dynamic sharing of the work. Thanks to a detailed experimental study, we show that PLCMQS is the only algorithm which is generic enough to compute efficiently closed frequent itemsets both on sparse and dense databases, thus improving the state of the art.
Benjamin Négrevergne, Alexandre Termier, Je
Added 26 Jan 2011
Updated 26 Jan 2011
Type Journal
Year 2010
Where IEEEHPCS
Authors Benjamin Négrevergne, Alexandre Termier, Jean-François Méhaut, Takeaki Uno
Comments (0)