Nearest-Neighbor-Based Active Learning for Rare Category Detection

15 years 9 months ago

Download books.nips.cc

Rare category detection is an open challenge for active learning, especially in the de-novo case (no labeled examples), but of signiﬁcant practical importance for data mining - e.g. detecting new ﬁnancial transaction fraud patterns, where normal legitimate transactions dominate. This paper develops a new method for detecting an instance of each minority class via an unsupervised local-density-differential sampling strategy. Essentially a variable-scale nearest neighbor process is used to optimize the probability of sampling tightly-grouped minority classes, subject to a local smoothness assumption of the majority class. Results on both synthetic and real data sets are very positive, detecting each minority class with only a fraction of the actively sampled points required by random sampling and by Pelleg’s Interleave method, the prior best technique in the sparse literature on this topic.

Jingrui He, Jaime G. Carbonell

Real-time Traffic

Information Technology | Minority Class | NIPS 2007 | Normal Legitimate Transactions | Tightly-grouped Minority Classes |

claim paper

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2007
Where	NIPS
Authors	Jingrui He, Jaime G. Carbonell

Comments (0)

Sciweavers

Nearest-Neighbor-Based Active Learning for Rare Category Detection

Information Technology | Minority Class | NIPS 2007 | Normal Legitimate Transactions | Tightly-grouped Minority Classes |

Explore & Download

Productivity Tools

Sciweavers