Sciweavers

1227 search results - page 176 / 246
» Approximate Kernel Clustering
Sort
View
OSDI
2008
ACM
15 years 10 months ago
Improving MapReduce Performance in Heterogeneous Environments
MapReduce is emerging as an important programming model for large-scale data-parallel applications such as web indexing, data mining, and scientific simulation. Hadoop is an open-...
Matei Zaharia, Andy Konwinski, Anthony D. Joseph, ...
CIDM
2007
IEEE
15 years 1 months ago
An Efficient Distance Calculation Method for Uncertain Objects
Recently the academic communities have paid more attention to the queries and mining on uncertain data. In the tasks such as clustering or nearest-neighbor queries, expected distan...
Lurong Xiao, Edward Hung
CLA
2006
14 years 11 months ago
Direct Factorization by Similarity of Fuzzy Concept Lattices by Factorization of Input Data
The paper presents additional results on factorization by similarity of fuzzy concept lattices. A fuzzy concept lattice is a hierarchically ordered collection of clusters extracted...
Radim Belohlávek, Jan Outrata, Vilém...
SISAP
2009
IEEE
134views Data Mining» more  SISAP 2009»
15 years 4 months ago
Searching by Similarity and Classifying Images on a Very Large Scale
—In the demonstration we will show a system for searching by similarity and automatically classifying images in a very large dataset. The demonstrated techniques are based on the...
Giuseppe Amato, Pasquale Savino
PODS
2012
ACM
281views Database» more  PODS 2012»
13 years 10 days ago
Mergeable summaries
We study the mergeability of data summaries. Informally speaking, mergeability requires that, given two summaries on two data sets, there is a way to merge the two summaries into ...
Pankaj K. Agarwal, Graham Cormode, Zengfeng Huang,...