Sciweavers

KES
2004
Springer

A Comparison of Two Approaches to Data Mining from Imbalanced Data

14 years 4 months ago
A Comparison of Two Approaches to Data Mining from Imbalanced Data
Our objective is a comparison of two data mining approaches to dealing with imbalanced data sets. The first approach is based on saving the original rule set, induced by the LEM2 algorithm, and changing the rule strength for all rules for the smaller class (concept) during classification. In the second approach, rule induction was split: the rule set for the larger class was induced by LEM2, while the rule set for the smaller class was induced by EXPLORE, another data mining algorithm. Results of our experiments show that both approaches increase the sensitivity compared to the original LEM2. However, the difference in performance of both approaches is statistically insignificant. Thus the appropriate approach to dealing with imbalanced data sets should be selected individually for a specific data set.
Jerzy W. Grzymala-Busse, Jerzy Stefanowski, Szymon
Added 02 Jul 2010
Updated 02 Jul 2010
Type Conference
Year 2004
Where KES
Authors Jerzy W. Grzymala-Busse, Jerzy Stefanowski, Szymon Wilk
Comments (0)