Sciweavers

346 search results - page 7 / 70
» Scalable Parallel Clustering for Data Mining on Multicompute...
Sort
View
90
Voted
IPPS
2006
IEEE
15 years 3 months ago
Design and analysis of a multi-dimensional data sampling service for large scale data analysis applications
Sampling is a widely used technique to increase efficiency in database and data mining applications operating on large dataset. In this paper we present a scalable sampling imple...
Xi Zhang, Tahsin M. Kurç, Joel H. Saltz, Sr...
IDEAS
2006
IEEE
218views Database» more  IDEAS 2006»
15 years 3 months ago
PBIRCH: A Scalable Parallel Clustering algorithm for Incremental Data
We present a parallel version of BIRCH with the objective of enhancing the scalability without compromising on the quality of clustering. The incoming data is distributed in a cyc...
Ashwani Garg, Ashish Mangla, Neelima Gupta, Vasudh...
HPCN
1999
Springer
15 years 1 months ago
JAVA as a Basis for Parallel Data Mining in Workstation Clusters
Matthias Gimbel, Michael Philippsen, Bernhard Haum...
KDD
2007
ACM
132views Data Mining» more  KDD 2007»
15 years 10 months ago
A scalable modular convex solver for regularized risk minimization
A wide variety of machine learning problems can be described as minimizing a regularized risk functional, with different algorithms using different notions of risk and different r...
Choon Hui Teo, Alex J. Smola, S. V. N. Vishwanatha...
ICDM
2003
IEEE
99views Data Mining» more  ICDM 2003»
15 years 2 months ago
Scalable Model-based Clustering by Working on Data Summaries
The scalability problem in data mining involves the development of methods for handling large databases with limited computational resources. In this paper, we present a two-phase...
Huidong Jin, Man Leung Wong, Kwong-Sak Leung