Generating Diverse Ensembles to Counter the Problem of Class Imbalance

15 years 8 months ago

Download www.nd.edu

Abstract. One of the more challenging problems faced by the data mining community is that of imbalanced datasets. In imbalanced datasets one class (sometimes severely) outnumbers the other class, causing correct, and useful predictions to be difficult to achieve. In order to combat this, many techniques have been proposed, especially centered around sampling methods. In this paper we propose an ensemble framework that combines random subspaces with sampling to overcome the class imbalance problem. We then experimentally verify this technique on a wide variety of datasets. We conclude by analyzing the performance of the ensembles, and showing that, overall, our technique provides a significant improvement.

T. Ryan Hoens, Nitesh V. Chawla

Real-time Traffic

Class Imbalance Problem | Data Mining | Data Mining Community | Imbalanced Datasets | PAKDD 2010 |

claim paper

» Automatically countering imbalance and its empirical relationship to cost

» Ensemble of OneClass Classifiers for Network Intrusion Detection System

» Analysing the localisation sites of proteins through neural networks ensembles

» An Empirical Comparison of Hierarchical vs TwoLevel Approaches to Multiclass Problems

Post Info
More Details (n/a)

Added	14 Oct 2010
Updated	14 Oct 2010
Type	Conference
Year	2010
Where	PAKDD
Authors	T. Ryan Hoens, Nitesh V. Chawla

Comments (0)

Sciweavers

Generating Diverse Ensembles to Counter the Problem of Class Imbalance

Class Imbalance Problem | Data Mining | Data Mining Community | Imbalanced Datasets | PAKDD 2010 |

Explore & Download

Productivity Tools

Sciweavers