Sciweavers

148 search results - page 10 / 30
» HaLoop: Efficient Iterative Data Processing on Large Cluster...
Sort
View
SDM
2004
SIAM
187views Data Mining» more  SDM 2004»
14 years 11 months ago
Minimum Sum-Squared Residue Co-Clustering of Gene Expression Data
Microarray experiments have been extensively used for simultaneously measuring DNA expression levels of thousands of genes in genome research. A key step in the analysis of gene e...
Hyuk Cho, Inderjit S. Dhillon, Yuqiang Guan, Suvri...
PVLDB
2010
132views more  PVLDB 2010»
14 years 7 months ago
CoDA: Interactive Cluster Based Concept Discovery
Large data resources are ubiquitous in science and business. For these domains, an intuitive view on the data is essential to fully exploit the hidden knowledge. Often, these data...
Stephan Günnemann, Ines Färber, Hardy Kr...
IPPS
2010
IEEE
14 years 7 months ago
Large-scale multi-dimensional document clustering on GPU clusters
Document clustering plays an important role in data mining systems. Recently, a flocking-based document clustering algorithm has been proposed to solve the problem through simulat...
Yongpeng Zhang, Frank Mueller, Xiaohui Cui, Thomas...
ICDE
2007
IEEE
129views Database» more  ICDE 2007»
15 years 3 months ago
Ontology-driven Rule Generalization and Categorization for Market Data
—Radio Frequency Identification (RFID) is an emerging technique that can significantly enhance supply chain processes and deliver customer service improvements. RFID provides use...
Dongwoo Won, Dennis McLeod
ICPR
2008
IEEE
15 years 3 months ago
Incremental clustering via nonnegative matrix factorization
Nonnegative matrix factorization (NMF) has been shown to be an efficient clustering tool. However, NMF`s batch nature necessitates recomputation of whole basis set for new samples...
Serhat Selcuk Bucak, Bilge Günsel