Spam filter evaluation with imprecise ground truth

16 years 1 months ago

Download plg.uwaterloo.ca

When trained and evaluated on accurately labeled datasets, online email spam ﬁlters are remarkably eﬀective, achieving error rates an order of magnitude better than classiﬁers in similar applications. But labels acquired from user feedback or third-party adjudication exhibit higher error rates than the best ﬁlters – even ﬁlters trained using the same source of labels. It is appropriate to use naturally occuring labels – including errors – as training data in evaluating spam ﬁlters. Erroneous labels are problematic, however, when used as ground truth to measure ﬁlter eﬀectiveness. Any measurement of the ﬁlter’s error rate will be augmented and perhaps masked by the label error rate. Using two natural sources of labels, we demonstrate automatic and semi-automatic methods that reduce the inﬂuence of labeling errors on evaluation, yielding substantially more precise measurements of true ﬁlter error rates. Categories and Subject Descriptors: H.3.3 [Information...

Gordon V. Cormack, Aleksander Kolcz

Real-time Traffic