Sciweavers

SIGKDD
2008

Learning to improve area-under-FROC for imbalanced medical data classification using an ensemble method

13 years 3 months ago
Learning to improve area-under-FROC for imbalanced medical data classification using an ensemble method
This paper presents our solution for KDD Cup 2008 competition that aims at optimizing the area under ROC for breast cancer detection. We exploited weighted-based classification mechanism to improve the accuracy of patient classification (each patient is represented by a collection of data points). Final predictions for challenge 1 are generated by combining outputs from weighted SVM and AdaBoost; whereas we integrate SVM, AdaBoost, and GA to produce the results for challenge 2. We have also tried location-based classification and model adaptation to add the testing data into training. Our results outperform other participants given the same set of features, and was selected as the joint winner in KDD Cup 2008. Keywords Support Vector Machines, AdaBoost, ensemble method, breast cancer image classification, area under free response receiver operating curves (FROC).
Hung-Yi Lo, Chun-Min Chang, Tsung-Hsien Chiang, Ch
Added 15 Dec 2010
Updated 15 Dec 2010
Type Journal
Year 2008
Where SIGKDD
Authors Hung-Yi Lo, Chun-Min Chang, Tsung-Hsien Chiang, Cho-Yi Hsiao, Anta Huang, Tsung-Ting Kuo, Wei-Chi Lai, Ming-Han Yang, Jung-Jung Yeh, Chun-Chao Yen, Shou-De Lin
Comments (0)