Sciweavers

553 search results - page 45 / 111
» A Compress-Based Association Mining Algorithm for Large Data...
Sort
View
SDM
2011
SIAM
241views Data Mining» more  SDM 2011»
14 years 21 days ago
A Fast Algorithm for Sparse PCA and a New Sparsity Control Criteria
Sparse principal component analysis (PCA) imposes extra constraints or penalty terms to the standard PCA to achieve sparsity. In this paper, we first introduce an efficient algor...
Yunlong He, Renato Monteiro, Haesun Park
AUSDM
2007
Springer
101views Data Mining» more  AUSDM 2007»
15 years 1 months ago
Exploratory Multilevel Hot Spot Analysis: Australian Taxation Office Case Study
Population based real-life datasets often contain smaller clusters of unusual sub-populations. While these clusters, called `hot spots', are small and sparse, they are usuall...
Denny, Graham J. Williams, Peter Christen
ICDM
2002
IEEE
159views Data Mining» more  ICDM 2002»
15 years 2 months ago
O-Cluster: Scalable Clustering of Large High Dimensional Data Sets
Clustering large data sets of high dimensionality has always been a serious challenge for clustering algorithms. Many recently developed clustering algorithms have attempted to ad...
Boriana L. Milenova, Marcos M. Campos
ICCS
2009
Springer
15 years 4 months ago
Frequent Itemset Mining for Clustering Near Duplicate Web Documents
A vast amount of documents in the Web have duplicates, which is a challenge for developing efficient methods that would compute clusters of similar documents. In this paper we use ...
Dmitry I. Ignatov, Sergei O. Kuznetsov
DCC
2000
IEEE
15 years 2 months ago
Summary Structures for Frequency Queries on Large Transaction Sets
As large-scale databases become commonplace, there has been signi cant interest in mining them for commercial purposes. One of the basic tasks that underlies many of these mining ...
Dow-Yung Yang, Akshay Johar, Ananth Grama, Wojciec...