Cost-Based Sampling of Individual Instances

16 years 1 months ago

Download www.site.uottawa.ca

In many practical domains, misclassiﬁcation costs can diﬀer greatly and may be represented by class ratios, however, most learning algorithms struggle with skewed class distributions. The diﬃculty is attributed to designing classiﬁers to maximize the accuracy. Researchers call for using several techniques to address this problem including; undersampling the majority class, employing a probabilistic algorithm, and adjusting the classiﬁcation threshold. In this paper, we propose a general sampling approach that assigns weights to individual instances according to the cost function. This approach helps reveal the relationship between classiﬁcation performance and class ratios and allows the identiﬁcation of an appropriate class distribution for which, the learning method achieves a reasonable performance on the data. Our results show that combining an ensemble of Naive Bayes classiﬁers with threshold selection and under-sampling techniques works well for imbalanced data. K...

William Klement, Peter A. Flach, Nathalie Japkowic

Real-time Traffic

AI 2009 | Artificial Intelligence | Class Distribution | Class Ratios | Skewed Class Distributions |

claim paper

» Optimal Predictions in Everyday Cognition The Wisdom of Individuals or Crowds

» Scanning sequences after Gibbs sampling to find multiple occurrences of functional element...

» Active Sampling for Rank Learning via Optimizing the Area under the ROC Curve

» WorstCase Analysis of Selective Sampling for Linear Classification

» Bounds on the Sample Complexity for Private Learning and Private Data Release

» Smooth sensitivity and sampling in private data analysis

» Efficiently learning the accuracy of labeling sources for selective sampling

» Particle Swarm CMA Evolution Strategy for the optimization of multifunnel landscapes

Post Info
More Details (n/a)

Added	25 May 2010
Updated	25 May 2010
Type	Conference
Year	2009
Where	AI
Authors	William Klement, Peter A. Flach, Nathalie Japkowicz, Stan Matwin

Comments (0)

Sciweavers

Cost-Based Sampling of Individual Instances

AI 2009 | Artificial Intelligence | Class Distribution | Class Ratios | Skewed Class Distributions |

Explore & Download

Productivity Tools

Sciweavers