Sciweavers

AI
2001
Springer

A Case Study for Learning from Imbalanced Data Sets

13 years 9 months ago
A Case Study for Learning from Imbalanced Data Sets
We present our experience in applying a rule induction technique to an extremely imbalanced pharmaceutical data set. We focus on using a variety of performance measures to evaluate a number of rule quality measures. We also investigate whether simply changing the distribution skew in the training data can improve predictive performance. Finally, we propose a method for adjusting the learning algorithm for learning in an extremely imbalanced environment. Our experimental results show that this adjustment improves predictive performance for rule quality formulas in which rule coverage makes positive contributions to the rule quality value.
Aijun An, Nick Cercone, Xiangji Huang
Added 28 Jul 2010
Updated 28 Jul 2010
Type Conference
Year 2001
Where AI
Authors Aijun An, Nick Cercone, Xiangji Huang
Comments (0)