Sciweavers

969 search results - page 113 / 194
» Clustering performance data efficiently at massive scales
Sort
View
MM
2004
ACM
151views Multimedia» more  MM 2004»
15 years 6 months ago
Affinity relation discovery in image database clustering and content-based retrieval
In this paper, we propose a unified framework, called Markov Model Mediator (MMM), to facilitate image database clustering and to improve the query performance. The structure of t...
Mei-Ling Shyu, Shu-Ching Chen, Min Chen, Chengcui ...
117
Voted
KDD
2009
ACM
198views Data Mining» more  KDD 2009»
16 years 1 months ago
Pervasive parallelism in data mining: dataflow solution to co-clustering large and sparse Netflix data
All Netflix Prize algorithms proposed so far are prohibitively costly for large-scale production systems. In this paper, we describe an efficient dataflow implementation of a coll...
Srivatsava Daruru, Nena M. Marin, Matt Walker, Joy...
107
Voted
BMCBI
2007
129views more  BMCBI 2007»
15 years 23 days ago
HoughFeature, a novel method for assessing drug effects in three-color cDNA microarray experiments
Background: Three-color microarray experiments can be performed to assess drug effects on the genomic scale. The methodology may be useful in shortening the cycle, reducing the co...
Hongya Zhao, Hong Yan
93
Voted
IPPS
2009
IEEE
15 years 7 months ago
A metascalable computing framework for large spatiotemporal-scale atomistic simulations
A metascalable (or “design once, scale on new architectures”) parallel computing framework has been developed for large spatiotemporal-scale atomistic simulations of materials...
Ken-ichi Nomura, Richard Seymour, Weiqiang Wang, H...
FAST
2010
15 years 3 months ago
Efficient Object Storage Journaling in a Distributed Parallel File System
Journaling is a widely used technique to increase file system robustness against metadata and/or data corruptions. While the overhead of journaling can be masked by the page cache...
Sarp Oral, Feiyi Wang, David Dillow, Galen M. Ship...