Sciweavers

1950 search results - page 68 / 390
» Informative sampling for large unbalanced data sets
Sort
View
BMCBI
2005
121views more  BMCBI 2005»
15 years 1 months ago
Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data
Background: A critical step in processing oligonucleotide microarray data is combining the information in multiple probes to produce a single number that best captures the express...
Kerby Shedden, Wei Chen, Rork Kuick, Debashis Ghos...
SDM
2009
SIAM
175views Data Mining» more  SDM 2009»
15 years 10 months ago
Low-Entropy Set Selection.
Most pattern discovery algorithms easily generate very large numbers of patterns, making the results impossible to understand and hard to use. Recently, the problem of instead sel...
Hannes Heikinheimo, Jilles Vreeken, Arno Siebes, H...
COLT
2004
Springer
15 years 6 months ago
An Inequality for Nearly Log-Concave Distributions with Applications to Learning
Abstract— We prove that given a nearly log-concave distribution, in any partition of the space to two well separated sets, the measure of the points that do not belong to these s...
Constantine Caramanis, Shie Mannor
KDD
2007
ACM
227views Data Mining» more  KDD 2007»
16 years 1 months ago
Fast best-effort pattern matching in large attributed graphs
We focus on large graphs where nodes have attributes, such as a social network where the nodes are labelled with each person's job title. In such a setting, we want to find s...
Hanghang Tong, Christos Faloutsos, Brian Gallagher...
ICML
2007
IEEE
16 years 2 months ago
Beamforming using the relevance vector machine
Beamformers are spatial filters that pass source signals in particular focused locations while suppressing interference from elsewhere. The widely-used minimum variance adaptive b...
David P. Wipf, Srikantan S. Nagarajan