Sciweavers

142 search results - page 3 / 29
» Clustering Large Datasets in Arbitrary Metric Spaces
Sort
View
JPDC
2007
138views more  JPDC 2007»
13 years 5 months ago
Distributed computation of the knn graph for large high-dimensional point sets
High-dimensional problems arising from robot motion planning, biology, data mining, and geographic information systems often require the computation of k nearest neighbor (knn) gr...
Erion Plaku, Lydia E. Kavraki
CCGRID
2004
IEEE
13 years 9 months ago
Serving queries to multi-resolution datasets on disk-based storage clusters
This paper is concerned with efficient querying of very large multi-resolution datasets on storage and compute clusters. We present a suite of services that support storage, index...
Xi Zhang, Tony Pan, Ümit V. Çataly&uum...
SISAP
2008
IEEE
147views Data Mining» more  SISAP 2008»
13 years 11 months ago
An Empirical Evaluation of a Distributed Clustering-Based Index for Metric Space Databases
Similarity search has been proved suitable for searching in very large collections of unstructured data objects. We are interested in efficient parallel query processing under si...
Veronica Gil Costa, Mauricio Marín, Nora Re...
SODA
2008
ACM
200views Algorithms» more  SODA 2008»
13 years 6 months ago
Clustering for metric and non-metric distance measures
We study a generalization of the k-median problem with respect to an arbitrary dissimilarity measure D. Given a finite set P, our goal is to find a set C of size k such that the s...
Marcel R. Ackermann, Johannes Blömer, Christi...
JAIR
2010
94views more  JAIR 2010»
13 years 3 months ago
Which Clustering Do You Want? Inducing Your Ideal Clustering with Minimal Feedback
While traditional research on text clustering has largely focused on grouping documents by topic, it is conceivable that a user may want to cluster documents along other dimension...
Sajib Dasgupta, Vincent Ng