Using cross-validation to evaluate predictive accuracy of survival risk classifiers based on high-dimensional data

14 years 10 months ago

Download linus.nci.nih.gov

Developments in whole genome biotechnology have stimulated statistical focus on prediction methods. We review here methodology for classifying patients into survival risk groups and for using cross-validation to evaluate such classifications. Measures of discrimination for survival risk models include separation of survival curves, time-dependent ROC curves and Harrell’s concordance index. For high-dimensional data applications, however, computing these measures as re-substitution statistics on the same data used for model development results in highly biased estimates. Most developments in methodology for survival risk modeling with high-dimensional data have utilized separate test data sets for model evaluation. Cross-validation has sometimes been used for optimization of tuning parameters. In many applications, however, the data available are too limited for effective division into training and test sets and consequently authors have often either reported re-substitution statisti...

Richard M. Simon, Jyothi Subramanian, Ming-Chung L

Real-time Traffic

BIB 2011 | Internet Technology | ROC Curves | Survival Curves | Test Data Sets |

claim paper

» A multifactorial analysis of obesity as CVD risk factor Use of neural network based method...

» Classification of premalignant pancreatic cancer massspectrometry data using decision tree...

Post Info
More Details (n/a)

Added	24 Aug 2011
Updated	24 Aug 2011
Type	Journal
Year	2011
Where	BIB
Authors	Richard M. Simon, Jyothi Subramanian, Ming-Chung Li, Supriya Menezes

Comments (0)

Sciweavers

Using cross-validation to evaluate predictive accuracy of survival risk classifiers based on high-dimensional data

BIB 2011 | Internet Technology | ROC Curves | Survival Curves | Test Data Sets |

Explore & Download

Productivity Tools

Sciweavers