Sciweavers

PRL
2006

Data complexity assessment in undersampled classification of high-dimensional biomedical data

14 years 14 days ago
Data complexity assessment in undersampled classification of high-dimensional biomedical data
Regularized linear classifiers have been successfully applied in undersampled, i.e. small sample size/high dimensionality biomedical classification problems. Additionally, a design of data complexity measures was proposed in order to assess the competence of a classifier in a particular context. Our work was motivated by the analysis of ill-posed regression problems by Elden and the interpretation of linear discriminant analysis as a mean square error classifier. Using Singular Value Decomposition analysis, we define a discriminatory power spectrum and show that it provides useful means of data complexity assessment for undersampled classification problems. In five real-life biomedical data sets of increasing difficulty we demonstrate how the data complexity of a classification problem can be related to the performance of regularized linear classifiers. We show that the concentration of the discriminatory power manifested in the discriminatory power spectrum is a deciding factor for t...
Richard Baumgartner, Ray L. Somorjai
Added 14 Dec 2010
Updated 14 Dec 2010
Type Journal
Year 2006
Where PRL
Authors Richard Baumgartner, Ray L. Somorjai
Comments (0)