Sciweavers

77 search results - page 1 / 16
» PBIRCH: A Scalable Parallel Clustering algorithm for Increme...
Sort
View
IDEAS
2006
IEEE
218views Database» more  IDEAS 2006»
13 years 10 months ago
PBIRCH: A Scalable Parallel Clustering algorithm for Incremental Data
We present a parallel version of BIRCH with the objective of enhancing the scalability without compromising on the quality of clustering. The incoming data is distributed in a cyc...
Ashwani Garg, Ashish Mangla, Neelima Gupta, Vasudh...
CLUSTER
2009
IEEE
13 years 9 months ago
Numerically stable, single-pass, parallel statistics algorithms
—Statistical analysis is widely used for countless scientific applications in order to analyze and infer meaning from data. A key challenge of any statistical analysis package a...
Janine Bennett, R. Grout, Philippe P. Pébay...
DATAMINE
1999
140views more  DATAMINE 1999»
13 years 4 months ago
A Scalable Parallel Algorithm for Self-Organizing Maps with Applications to Sparse Data Mining Problems
Abstract. We describe a scalable parallel implementation of the self organizing map (SOM) suitable for datamining applications involving clustering or segmentation against large da...
Richard D. Lawrence, George S. Almasi, Holly E. Ru...
SIGMOD
2011
ACM
210views Database» more  SIGMOD 2011»
12 years 7 months ago
A platform for scalable one-pass analytics using MapReduce
Today’s one-pass analytics applications tend to be data-intensive in nature and require the ability to process high volumes of data efficiently. MapReduce is a popular programm...
Boduo Li, Edward Mazur, Yanlei Diao, Andrew McGreg...
ICPP
2000
IEEE
13 years 8 months ago
A Scalable Parallel Subspace Clustering Algorithm for Massive Data Sets
Clustering is a data mining problem which finds dense regions in a sparse multi-dimensional data set. The attribute values and ranges of these regions characterize the clusters. ...
Harsha S. Nagesh, Sanjay Goil, Alok N. Choudhary