Sciweavers

969 search results - page 69 / 194
» Clustering performance data efficiently at massive scales
Sort
View
99
Voted
BIOINFORMATICS
2005
71views more  BIOINFORMATICS 2005»
15 years 21 days ago
Semi-supervised protein classification using cluster kernels
A key issue in supervised protein classification is the representation of input sequences of amino acids. Recent work using string kernels for protein data has achieved state-of-t...
Jason Weston, Christina S. Leslie, Eugene Ie, Deng...
224
Voted
ICDE
2006
IEEE
215views Database» more  ICDE 2006»
16 years 2 months ago
cgmOLAP: Efficient Parallel Generation and Querying of Terabyte Size ROLAP Data Cubes
In this demo we present the cgmOLAP server, the first fully functional parallel OLAP system able to build data cubes at a rate of more than 1 Terabyte per hour. cgmOLAP incorporat...
Ying Chen, Andrew Rau-Chaplin, Frank K. H. A. Dehn...
117
Voted
EDBT
2008
ACM
167views Database» more  EDBT 2008»
16 years 27 days ago
HISSCLU: a hierarchical density-based method for semi-supervised clustering
In situations where class labels are known for a part of the objects, a cluster analysis respecting this information, i.e. semi-supervised clustering, can give insight into the cl...
Christian Böhm, Claudia Plant
102
Voted
ITSL
2008
15 years 2 months ago
An Empirical Comparison of NML Clustering Algorithms
Clustering can be defined as a data assignment problem where the goal is to partition the data into nonhierarchical groups of items. In our previous work, we suggested an informati...
Petri Kontkanen, Petri Myllymäki
99
Voted
CCGRID
2004
IEEE
15 years 4 months ago
Serving queries to multi-resolution datasets on disk-based storage clusters
This paper is concerned with efficient querying of very large multi-resolution datasets on storage and compute clusters. We present a suite of services that support storage, index...
Xi Zhang, Tony Pan, Ümit V. Çataly&uum...