Search Sciweavers | Sciweavers

1950 search results - page 1 / 390

» Informative sampling for large unbalanced data sets

177

click to vote

GECCO
2008
Springer

137views Optimization» more GECCO 2008»

Informative sampling for large unbalanced data sets

15 years 7 months ago

Download www.cs.uvm.edu

Selective sampling is a form of active learning which can reduce the cost of training by only drawing informative data points into the training set. This selected training set is ...

Zhenyu Lu, Anand I. Rughani, Bruce I. Tranmer, Jos...

claim paper

Read More »

170

click to vote

DMIN
2007

186views Data Mining» more DMIN 2007»

Cost-Sensitive Learning vs. Sampling: Which is Best for Handling Unbalanced Classes with Unequal Error Costs?

15 years 7 months ago

Download storm.cis.fordham.edu

- The classifier built from a data set with a highly skewed class distribution generally predicts the more frequently occurring classes much more often than the infrequently occurr...

Gary M. Weiss, Kate McCarthy, Bibi Zabar

claim paper

Read More »

136

click to vote

BMCBI
2006

86views more BMCBI 2006»

The impact of sample imbalance on identifying differentially expressed genes

15 years 6 months ago

Download www.biomedcentral.com

Background: Recently several statistical methods have been proposed to identify genes with differential expression between two conditions. However, very few studies consider the p...

Kun Yang, Jianzhong Li, Hong Gao

claim paper

Read More »

146

click to vote

KDD
2002
ACM

138views Data Mining» more KDD 2002»

Learning to match and cluster large high-dimensional data sets for data integration

16 years 6 months ago

Download www.cs.cmu.edu

Part of the process of data integration is determining which sets of identifiers refer to the same real-world entities. In integrating databases found on the Web or obtained by us...

William W. Cohen, Jacob Richman

claim paper

Read More »

155

click to vote

CORR
2010
Springer

138views Education» more CORR 2010»

Rules of Thumb for Information Acquisition from Large and Redundant Data

15 years 2 months ago

Download www.cs.washington.edu

We develop an abstract model of information acquisition from redundant data. We assume a random sampling process from data which contain information with bias and are interested in...

Wolfgang Gatterbauer

claim paper

Read More »

« Prev « First page 1 / 390 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers