Sciweavers

BIB
2011

Using cross-validation to evaluate predictive accuracy of survival risk classifiers based on high-dimensional data

12 years 8 months ago
Using cross-validation to evaluate predictive accuracy of survival risk classifiers based on high-dimensional data
Developments in whole genome biotechnology have stimulated statistical focus on prediction methods. We review here methodology for classifying patients into survival risk groups and for using cross-validation to evaluate such classifications. Measures of discrimination for survival risk models include separation of survival curves, time-dependent ROC curves and Harrell’s concordance index. For high-dimensional data applications, however, computing these measures as re-substitution statistics on the same data used for model development results in highly biased estimates. Most developments in methodology for survival risk modeling with high-dimensional data have utilized separate test data sets for model evaluation. Cross-validation has sometimes been used for optimization of tuning parameters. In many applications, however, the data available are too limited for effective division into training and test sets and consequently authors have often either reported re-substitution statisti...
Richard M. Simon, Jyothi Subramanian, Ming-Chung L
Added 24 Aug 2011
Updated 24 Aug 2011
Type Journal
Year 2011
Where BIB
Authors Richard M. Simon, Jyothi Subramanian, Ming-Chung Li, Supriya Menezes
Comments (0)