Sciweavers

346 search results - page 20 / 70
» Scalable Parallel Clustering for Data Mining on Multicompute...
Sort
View
IPPS
2008
IEEE
15 years 4 months ago
Parallel mining of closed quasi-cliques
Graph structure can model the relationships among a set of objects. Mining quasi-clique patterns from large dense graph data makes sense with respect to both statistic and applica...
Yuzhou Zhang, Jianyong Wang, Zhiping Zeng, Lizhu Z...
DBISP2P
2008
Springer
124views Database» more  DBISP2P 2008»
14 years 11 months ago
Exploiting Distribution Skew for Scalable P2P Text Clustering
K-Means clustering is widely used in information retrieval and data mining. Distributed K-Means variants have already been proposed, but none of the past algorithms scales to large...
Odysseas Papapetrou, Wolf Siberski, Fabian Leitrit...
CLUSTER
2009
IEEE
15 years 2 months ago
Numerically stable, single-pass, parallel statistics algorithms
—Statistical analysis is widely used for countless scientific applications in order to analyze and infer meaning from data. A key challenge of any statistical analysis package a...
Janine Bennett, R. Grout, Philippe P. Pébay...
SIGKDD
2000
95views more  SIGKDD 2000»
14 years 9 months ago
Scalability for Clustering Algorithms Revisited
This paper presents a simple new algorithm that performs k-means clustering in one scan of a dataset, while using a bu er for points from the dataset of xed size. Experiments show...
Fredrik Farnstrom, James Lewis, Charles Elkan
105
Voted
ICDE
2012
IEEE
238views Database» more  ICDE 2012»
13 years 2 days ago
Mining Knowledge from Data: An Information Network Analysis Approach
Abstract—Most objects and data in the real world are interconnected, forming complex, heterogeneous but often semistructured information networks. However, many database research...
Jiawei Han, Yizhou Sun, Xifeng Yan, Philip S. Yu