Sciweavers

346 search results - page 16 / 70
» Scalable Parallel Clustering for Data Mining on Multicompute...
Sort
View
SDM
2007
SIAM
112views Data Mining» more  SDM 2007»
14 years 11 months ago
PoClustering: Lossless Clustering of Dissimilarity Data
Given a set of objects V with a dissimilarity measure between pairs of objects in V , a PoCluster is a collection of sets P ⊂ powerset(V ) partially ordered by the ⊂ relation ...
Jinze Liu, Qi Zhang, Wei Wang 0010, Leonard McMill...
CIKM
2009
Springer
15 years 4 months ago
SPIDER: a system for scalable, parallel / distributed evaluation of large-scale RDF data
RDF is a data model for representing labeled directed graphs, and it is used as an important building block of semantic web. Due to its flexibility and applicability, RDF has bee...
Hyunsik Choi, Jihoon Son, YongHyun Cho, Min Kyoung...
HPCC
2005
Springer
15 years 3 months ago
A Coarse Grained Parallel Algorithm for Closest Larger Ancestors in Trees with Applications to Single Link Clustering
Hierarchical clustering methods are important in many data mining and pattern recognition tasks. In this paper we present an efficient coarse grained parallel algorithm for Single...
Albert Chan, Chunmei Gao, Andrew Rau-Chaplin
PAKDD
2010
ACM
158views Data Mining» more  PAKDD 2010»
15 years 2 months ago
Integrative Parameter-Free Clustering of Data with Mixed Type Attributes
Abstract. Integrative mining of heterogeneous data is one of the major challenges for data mining in the next decade. We address the problem of integrative clustering of data with ...
Christian Böhm, Sebastian Goebl, Annahita Osw...
DATAMINE
2006
89views more  DATAMINE 2006»
14 years 9 months ago
Scalable Clustering Algorithms with Balancing Constraints
Clustering methods for data-mining problems must be extremely scalable. In addition, several data mining applications demand that the clusters obtained be balanced, i.e., be of ap...
Arindam Banerjee, Joydeep Ghosh