Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

10

INAP
2001
Springer

favoriteEmaildiscussreport

139views Information Technology» more INAP 2001»

Discovering Frequent Itemsets in the Presence of Highly Frequent Items

13 years 8 months ago

Discovering Frequent Itemsets in the Presence of Highly Frequent Items

Download www.informatics.indiana.edu

This paper presents new techniques for focusing the discoveryof frequent itemsets within large, dense datasets containing highly frequent items. The existence of highly frequent items adds signi cantly to the cost of computing the complete set of frequent itemsets. Our approach allows for the exclusion of such items during the candidate generation phase of the Apriori algorithm. Afterwards, the highly frequent items can be reintroduced, via an inferencing framework, providing for a capability to generate frequent itemsets without counting their frequency. We demonstrate the use of these new techniques within the well-studied framework of the Apriori algorithm. Furthermore, we provide empirical results using our techniques on both synthetic and real datasets - both relevant since the real datasets exhibit statistical characteristics di erent from the probabilistic assumptions behind the synthetic data. The source we used for real data was the U.S. Census.

Dennis P. Groth, Edward L. Robertson

Real-time Traffic

Discoveryof Frequent Itemsets | Frequent Items | Frequent Itemsets | INAP 2001 | Information Management |

claim paper

Related Content

» Efficient discovery of errortolerant frequent itemsets in high dimensions

» Algorithms for Discovery of Frequent Superset Rather than Frequent Subset

» Quantitative evaluation of approximate frequent pattern mining algorithms

» A Treebased Approach for Efficiently Mining Approximate Frequent Itemsets

» Fast Frequent Itemset Mining using Compressed Data Representation

» TwoPhase Algorithms for a Novel UtilityFrequent Mining Model

» Mining MultiLevel Frequent Itemsets under Constraints

» An improved multiple minimum support based approach to mine rare association rules

» A BottomUp Projection Based Algorithm for Mining High Utility Itemsets

Post Info
More Details (n/a)

Added	30 Jul 2010
Updated	30 Jul 2010
Type	Conference
Year	2001
Where	INAP
Authors	Dennis P. Groth, Edward L. Robertson

Comments (0)