Sciweavers

5209 search results - page 159 / 1042
» Multiobjective Data Clustering
Sort
View
167
Voted
OSDI
2004
ACM
16 years 3 months ago
MapReduce: Simplified Data Processing on Large Clusters
MapReduce is a programming model and an associated implementation for processing and generating large data sets. Users specify a map function that processes a key/value pair to ge...
Jeffrey Dean, Sanjay Ghemawat
140
Voted
IDEAL
2004
Springer
15 years 9 months ago
Visualisation of Distributions and Clusters Using ViSOMs on Gene Expression Data
Microarray datasets are often too large to visualise due to the high dimensionality. The self-organising map has been found useful to analyse massive complex datasets. It can be us...
Swapna Sarvesvaran, Hujun Yin
118
Voted
KES
2005
Springer
15 years 9 months ago
OntoExtractor: A Fuzzy-Based Approach in Clustering Semi-structured Data Sources and Metadata Generation
This paper describes a theoretical approach on data mining, information classifying and a global overview of our OntoExtractor application, concerning the analysis of incoming data...
Zhan Cui, Ernesto Damiani, Marcello Leida, Marco V...
222
Voted
ICDT
2009
ACM
148views Database» more  ICDT 2009»
16 years 4 months ago
Tight results for clustering and summarizing data streams
In this paper we investigate algorithms and lower bounds for summarization problems over a single pass data stream. In particular we focus on histogram construction and K-center c...
Sudipto Guha
160
Voted
ISVC
2009
Springer
15 years 10 months ago
Parallel 3D Image Segmentation of Large Data Sets on a GPU Cluster
In this paper, we propose an inherent parallel scheme for 3D image segmentation of large volume data on a GPU cluster. This method originates from an extended Lattice Boltzmann Mod...
Aaron Hagan, Ye Zhao