Sciweavers

17390 search results - page 106 / 3478
» Distributed Data Clustering
Sort
View
139
Voted
KDD
2007
ACM
276views Data Mining» more  KDD 2007»
16 years 2 months ago
Nonlinear adaptive distance metric learning for clustering
A good distance metric is crucial for many data mining tasks. To learn a metric in the unsupervised setting, most metric learning algorithms project observed data to a lowdimensio...
Jianhui Chen, Zheng Zhao, Jieping Ye, Huan Liu
CLUSTER
2006
IEEE
15 years 5 months ago
HPC Cluster Readiness of Xen and User Mode Linux
This paper examines the suitability of different virtualization techniques in a high performance cluster environment. A survey of virtualization techniques is presented. Two repre...
Wesley Emeneker, Dan Stanzione
SIGMOD
2009
ACM
140views Database» more  SIGMOD 2009»
16 years 1 months ago
Distributed data-parallel computing using a high-level programming language
The Dryad and DryadLINQ systems offer a new programming model for large scale data-parallel computing. They generalize previous execution environments such as SQL and MapReduce in...
Michael Isard, Yuan Yu
AAAI
2008
15 years 4 months ago
Clustering via Random Walk Hitting Time on Directed Graphs
In this paper, we present a general data clustering algorithm which is based on the asymmetric pairwise measure of Markov random walk hitting time on directed graphs. Unlike tradi...
Mo Chen, Jianzhuang Liu, Xiaoou Tang
KDD
2002
ACM
170views Data Mining» more  KDD 2002»
16 years 2 months ago
Enhanced word clustering for hierarchical text classification
In this paper we propose a new information-theoretic divisive algorithm for word clustering applied to text classification. In previous work, such "distributional clustering&...
Inderjit S. Dhillon, Subramanyam Mallela, Rahul Ku...