Prediction error estimation: a comparison of resampling methods

15 years 6 months ago

Download linus.nci.nih.gov

In genomic studies, thousands of features are collected on relatively few samples. One of the goals of these studies is to build classifiers to predict the outcome of future observations. There are three inherent steps to this process: feature selection, model selection, and prediction assessment. With a focus on prediction assessment, we compare several methods for estimating the 'true' prediction error of a prediction model in the presence of feature selection. For small studies where features are selected from thousands of candidates, the resubstitution and simple split-sample estimates are seriously biased. In these small samples, leave-one-out (LOOCV), 10-fold cross-validation (CV), and the .632+ bootstrap have the smallest bias for diagonal discriminant analysis, nearest neighbor, and classification trees. LOOCV and 10-fold CV have the smallest bias for linear discriminant analysis. Additionally, LOOCV, 5- and 10-fold CV, and the .632+ bootstrap have the lowest mean sq...

Annette M. Molinaro, Richard Simon, Ruth M. Pfeiff

Real-time Traffic

BIOINFORMATICS 2005 | Discriminant Analysis | Prediction Assessment | Smallest Bias |

claim paper

» Resampling methods for input modeling

» Rankinvariant resampling based estimation of false discovery rate for analysis of small sa...

» An Empirical Comparison of Pattern Recognition Neural Nets and Machine Learning Classifica...

» Estimating the Confidence of Statistical Model Based Shape Prediction

» Small Sample Inference for Generalization Error in Classification Using the CUD Bound

» Resampling strategy to improve the estimation of number of null hypotheses in FDR control ...

» Fast approximation of the bootstrap for model selection

» Sequential Noise Compensation by Sequential Monte Carlo Method

Post Info
More Details (n/a)

Added	15 Dec 2010
Updated	15 Dec 2010
Type	Journal
Year	2005
Where	BIOINFORMATICS
Authors	Annette M. Molinaro, Richard Simon, Ruth M. Pfeiffer

Comments (0)

Sciweavers

Prediction error estimation: a comparison of resampling methods

BIOINFORMATICS 2005 | Discriminant Analysis | Prediction Assessment | Smallest Bias |

Explore & Download

Productivity Tools

Sciweavers