Sciweavers

PAKDD
2004
ACM

Separating Structure from Interestingness

13 years 9 months ago
Separating Structure from Interestingness
Condensed representations of pattern collections have been recognized to be important building blocks of inductive databases, a promising theoretical framework for data mining, and recently they have been studied actively. However, there has not been much research on how condensed representations should actually be represented. In this paper we propose a general approach to build condensed representations of pattern collections. The approach is based on separating the structure of the pattern collection from the interestingness values of the patterns. We study also the concrete case of representing the frequent sets and their (approximate) frequencies following this approach: we discuss the trade-offs in representing the frequent sets by the maximal frequent sets, the minimal infrequent sets and their combinations, and investigate the problem approximating the frequencies from samples by giving new upper bounds on sample complexity based on frequent closed sets and describing how conv...
Taneli Mielikäinen
Added 02 Jul 2010
Updated 02 Jul 2010
Type Conference
Year 2004
Where PAKDD
Authors Taneli Mielikäinen
Comments (0)