Sciweavers

1950 search results - page 142 / 390
» Informative sampling for large unbalanced data sets
Sort
View
138
Voted
BMCBI
2007
94views more  BMCBI 2007»
15 years 3 months ago
A Hidden Markov Model to estimate population mixture and allelic copy-numbers in cancers using Affymetrix SNP arrays
Background: Affymetrix SNP arrays can interrogate thousands of SNPs at the same time. This allows us to look at the genomic content of cancer cells and to investigate the underlyi...
Philippe Lamy, Claus L. Andersen, Lars Dyrskjot, N...
157
Voted
INTENSIVE
2009
IEEE
15 years 10 months ago
A Service for Data-Intensive Computations on Virtual Clusters
Digital Preservation deals with the long-term storage, access, and maintenance of digital data objects. In order to prevent a loss of information, digital libraries and archives a...
Rainer Schmidt, Christian Sadilek, Ross King
129
Voted
GECCO
2009
Springer
188views Optimization» more  GECCO 2009»
15 years 7 months ago
Exploiting multiple classifier types with active learning
Many approaches to active learning involve training one classifier by periodically choosing new data points about which the classifier has the least confidence, but designing a co...
Zhenyu Lu, Josh Bongard
172
Voted
JIPS
2007
134views more  JIPS 2007»
15 years 3 months ago
An Efficient Functional Analysis Method for Micro-array Data Using Gene Ontology
: Microarray data includes tens of thousands of gene expressions simultaneously, so it can be effectively used in identifying the phenotypes of diseases. However, the retrieval of ...
Dong-wan Hong, Jong-keun Lee, Sung-soo Park, Sang-...
143
Voted
SDM
2007
SIAM
107views Data Mining» more  SDM 2007»
15 years 5 months ago
On Demand Phenotype Ranking through Subspace Clustering
High throughput biotechnologies have enabled scientists to collect a large number of genetic and phenotypic attributes for a large collection of samples. Computational methods are...
Xiang Zhang, Wei Wang 0010, Jun Huan