Sciweavers

346 search results - page 41 / 70
» Scalable Parallel Clustering for Data Mining on Multicompute...
Sort
View
79
Voted
SDM
2009
SIAM
144views Data Mining» more  SDM 2009»
15 years 6 months ago
CORE: Nonparametric Clustering of Large Numeric Databases.
Current clustering techniques are able to identify arbitrarily shaped clusters in the presence of noise, but depend on carefully chosen model parameters. The choice of model param...
Andrej Taliun, Arturas Mazeika, Michael H. Bö...
ICDCS
1999
IEEE
15 years 2 months ago
The Swarm Scalable Storage System
Swarm is a storage system that provides scalable, reliable, and cost-effective data storage. Swarm is based on storage servers, rather than file servers; the storage servers are o...
John H. Hartman, Ian Murdock, Tammo Spalink
FAST
2010
15 years 5 hour ago
Panache: A Parallel File System Cache for Global File Access
Cloud computing promises large-scale and seamless access to vast quantities of data across the globe. Applications will demand the reliability, consistency, and performance of a t...
Marc Eshel, Roger L. Haskin, Dean Hildebrand, Mano...
NCA
2006
IEEE
15 years 3 months ago
Parallelization of Geant4 Using TOP-C and Marshalgen
Geant4 is a very large, highly accurate toolkit for Monte Carlo simulation of particle-matter interaction. It has been applied to high-energy physics, cosmic ray modeling, radiati...
Gene Cooperman, Viet Ha Nguyen, Igor Malioutov
SDM
2012
SIAM
237views Data Mining» more  SDM 2012»
13 years 2 days ago
A Distributed Kernel Summation Framework for General-Dimension Machine Learning
Kernel summations are a ubiquitous key computational bottleneck in many data analysis methods. In this paper, we attempt to marry, for the first time, the best relevant technique...
Dongryeol Lee, Richard W. Vuduc, Alexander G. Gray