Learning with Class Skews and Small Disjuncts

15 years 8 months ago

Download www.icmc.usp.br

One of the main objectives of a Machine Learning – ML – system is to induce a classiﬁer that minimizes classiﬁcation errors. Two relevant topics in ML are the understanding of which domain characteristics and inducer limitations might cause an increase in misclassiﬁcation. In this sense, this work analyzes two important issues that might inﬂuence the performance of ML systems: class imbalance and errorprone small disjuncts. Our main objective is to investigate how these two important aspects are related to each other. Aiming at overcoming both problems we analyzed the behavior of two over-sampling methods we have proposed, namely Smote + Tomek links and Smote + ENN. Our results suggest that these methods are eﬀective for dealing with class imbalance and, in some cases, might help in ruling out some undesirable disjuncts. However, in some cases a simpler method, Random over-sampling, provides compatible results requiring less computational resources.

Ronaldo C. Prati, Gustavo E. A. P. A. Batista, Mar

Real-time Traffic

Artificial Intelligence | Class Imbalance | Errorprone Small Disjuncts | Main Objective | SBIA 2004 |

claim paper

» Classification of Skewed and Homogenous Document Corpora with ClassBased and CorpusBased K...

» A Lower Bound for Agnostically Learning Disjunctions

» Lower Bounds for Agnostic Learning via Approximate Rank

» Learning Switching Concepts

» Learning querydependent prefilters for scalable image retrieval

» Learning with Queries Corrupted by Classification Noise

Post Info
More Details (n/a)

Added	02 Jul 2010
Updated	02 Jul 2010
Type	Conference
Year	2004
Where	SBIA
Authors	Ronaldo C. Prati, Gustavo E. A. P. A. Batista, Maria Carolina Monard

Comments (0)

Sciweavers

Learning with Class Skews and Small Disjuncts

Artificial Intelligence | Class Imbalance | Errorprone Small Disjuncts | Main Objective | SBIA 2004 |

Explore & Download

Productivity Tools

Sciweavers