Sciweavers

166 search results - page 1 / 34
» Classification on Data with Biased Class Distribution
Sort
View
ECML
2001
Springer
13 years 10 months ago
Classification on Data with Biased Class Distribution
Labeled data for classification could often be obtained by sampling that restricts or favors choice of certain classes. A classifier trained using such data will be biased, resulti...
Slobodan Vucetic, Zoran Obradovic
CVPR
2004
IEEE
14 years 7 months ago
Learning Classifiers from Imbalanced Data Based on Biased Minimax Probability Machine
We consider the problem of the binary classification on imbalanced data, in which nearly all the instances are labelled as one class, while far fewer instances are labelled as the...
Kaizhu Huang, Haiqin Yang, Irwin King, Michael R. ...
BMCBI
2006
110views more  BMCBI 2006»
13 years 5 months ago
Bias in error estimation when using cross-validation for model selection
Background: Cross-validation (CV) is an effective method for estimating the prediction error of a classifier. Some recent articles have proposed methods for optimizing classifiers...
Sudhir Varma, Richard Simon
SDM
2003
SIAM
156views Data Mining» more  SDM 2003»
13 years 6 months ago
Detection of Underrepresented Biological Sequences using Class-Conditional Distribution Models
A labeled sequence data set related to a certain biological property is often biased and, therefore, does not completely capture its diversity in nature. To reduce this sampling b...
Slobodan Vucetic, Dragoljub Pokrajac, Hongbo Xie, ...
KDD
2009
ACM
170views Data Mining» more  KDD 2009»
14 years 6 months ago
Genre-based decomposition of email class noise
Corruption of data by class-label noise is an important practical concern impacting many classification problems. Studies of data cleaning techniques often assume a uniform label ...
Aleksander Kolcz, Gordon V. Cormack