Sciweavers

5209 search results - page 124 / 1042
» Multiobjective Data Clustering
Sort
View
165
Voted
EDBT
2012
ACM
306views Database» more  EDBT 2012»
13 years 5 months ago
Clydesdale: structured data processing on MapReduce
MapReduce has emerged as a promising architecture for large scale data analytics on commodity clusters. The rapid adoption of Hive, a SQL-like data processing language on Hadoop (...
Tim Kaldewey, Eugene J. Shekita, Sandeep Tata
169
Voted
KDD
2003
ACM
191views Data Mining» more  KDD 2003»
16 years 3 months ago
Assessment and pruning of hierarchical model based clustering
The goal of clustering is to identify distinct groups in a dataset. The basic idea of model-based clustering is to approximate the data density by a mixture model, typically a mix...
Jeremy Tantrum, Alejandro Murua, Werner Stuetzle
138
Voted
CLUSTER
2008
IEEE
15 years 10 months ago
Enabling lock-free concurrent fine-grain access to massive distributed data: Application to supernovae detection
—We consider the problem of efficiently managing massive data in a large-scale distributed environment. We consider data strings of size in the order of Terabytes, shared and ac...
Bogdan Nicolae, Gabriel Antoniu, Luc Bougé
137
Voted
CVPR
2008
IEEE
16 years 5 months ago
Robust estimation of gaussian mixtures from noisy input data
We propose a variational bayes approach to the problem of robust estimation of gaussian mixtures from noisy input data. The proposed algorithm explicitly takes into account the un...
Shaobo Hou, Aphrodite Galata
133
Voted
DMKD
1997
ACM
198views Data Mining» more  DMKD 1997»
15 years 7 months ago
Clustering Based On Association Rule Hypergraphs
Clustering in data mining is a discovery process that groups a set of data such that the intracluster similarity is maximized and the intercluster similarity is minimized. These d...
Eui-Hong Han, George Karypis, Vipin Kumar, Bamshad...