Feature Selection for Classifying High-Dimensional Numerical Data

15 years 3 months ago

Download www.cse.buffalo.edu

Classifying high-dimensional numerical data is a very challenging problem. In high dimensional feature spaces, the performance of supervised learning methods suffer from the curse of dimensionality, which degrades both classification accuracy and efficiency. To address this issue, we present an efficient feature selection method to facilitate classifying high-dimensional numerical data. Our method employs balanced information gain to measure the contribution of each feature (for data classification); and it calculates feature correlation with a novel extension of balanced information gain. By integrating feature contribution and correlation, our feature selection approach uses a forward sequential selection algorithm to select uncorrelated features with large balanced information gain. Extensive experiments have been carried out on image and gene microarray datasets to demonstrate the effectiveness and robustness of the presented method.

Yimin Wu, Aidong Zhang

Real-time Traffic

Balanced Information Gain | Computer Vision | CVPR 2004 | Feature Contribution | Feature Correlation | Feature Selection Approach | Feature Selection Method |

claim paper

Post Info
More Details (n/a)

Added	12 Oct 2009
Updated	12 Oct 2009
Type	Conference
Year	2004
Where	CVPR
Authors	Yimin Wu, Aidong Zhang

Comments (0)

Sciweavers

Feature Selection for Classifying High-Dimensional Numerical Data

Balanced Information Gain | Computer Vision | CVPR 2004 | Feature Contribution | Feature Correlation | Feature Selection Approach | Feature Selection Method |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers