Sciweavers

80 search results - page 11 / 16
» Partitioning datasets based on equalities among parameters
Sort
View
ICDM
2006
IEEE
132views Data Mining» more  ICDM 2006»
15 years 3 months ago
High Quality, Efficient Hierarchical Document Clustering Using Closed Interesting Itemsets
High dimensionality remains a significant challenge for document clustering. Recent approaches used frequent itemsets and closed frequent itemsets to reduce dimensionality, and to...
Hassan H. Malik, John R. Kender
62
Voted
BMCBI
2010
80views more  BMCBI 2010»
14 years 9 months ago
Power and sample size estimation in microarray studies
Background: Before conducting a microarray experiment, one important issue that needs to be determined is the number of arrays required in order to have adequate power to identify...
Wei-Jiun Lin, Huey-miin Hsueh, James J. Chen
79
Voted
ICDM
2008
IEEE
164views Data Mining» more  ICDM 2008»
15 years 4 months ago
Classifying High-Dimensional Text and Web Data Using Very Short Patterns
In this paper, we propose the "Democratic Classifier", a simple, democracy-inspired patternbased classification algorithm that uses very short patterns for classificatio...
Hassan H. Malik, John R. Kender
RECOMB
2010
Springer
15 years 4 months ago
A Novel Abundance-Based Algorithm for Binning Metagenomic Sequences Using l-Tuples
Abstract. Metagenomics is the study of microbial communities sampled directly from their natural environment, without prior culturing. Among the computational tools recently develo...
Yu-Wei Wu, Yuzhen Ye
PVLDB
2010
118views more  PVLDB 2010»
14 years 8 months ago
Ten Thousand SQLs: Parallel Keyword Queries Computing
Keyword search in relational databases has been extensively studied. Given a relational database, a keyword query finds a set of interconnected tuple structures connected by fore...
Lu Qin, Jefferey Yu, Lijun Chang