Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning

16 years 7 months ago

Download www.cs.waikato.ac.nz

Algorithms for feature selection fall into two broad categories: wrappers that use the learning algorithm itself to evaluate the usefulness of features and filters that evaluate features according to heuristics based on general characteristics of the data. For application to large databases, filters have proven to be more practical than wrappers because they are much faster. However, most existing filter algorithms only work with discrete classification problems. This paper describes a fast, correlation-based filter algorithm that can be applied to continuous and discrete problems. The algorithm often outperforms the well-known ReliefF attribute estimator when used as a preprocessing step for naive Bayes, instance-based learning, decision trees, locally weighted regression, and model trees. It performs more feature selection than ReliefF does--reducing the data dimensionality by fifty percent in most cases. Also, decision and model trees built from the preprocessed data are often sign...

Mark A. Hall

Real-time Traffic

Correlation-based Filter Algorithm | Discrete Classification Problems | ICML 2000 | Machine Learning | Model Trees |

claim paper

» Sequence Discrimination Using PhaseType Distributions

» Recursive gene selection based on maximum margin criterion a comparison with SVMRFE

» Kernelimbedded Gaussian processes for disease classification using microarray gene express...

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2000
Where	ICML
Authors	Mark A. Hall

Comments (0)

Sciweavers

Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning

Correlation-based Filter Algorithm | Discrete Classification Problems | ICML 2000 | Machine Learning | Model Trees |

Explore & Download

Productivity Tools

Sciweavers