Naive Bayes and logistic regression perform well in different regimes. While the former is a very simple generative model which is efficient to train and performs well empirically...
Most spam filters are configured for use at a very low falsepositive rate. Typically, the filters are trained with techniques that optimize accuracy or entropy, rather than perfor...
We describe a very simple technique for discriminatively training a spam filter. Our results on the TREC Enron spam corpus would have been the best for the Ham at .1% measure, and...
We show that a set of independently developed spam filters may be combined in simple ways to provide substantially better filtering than any of the individual filters. The resu...
Thomas R. Lynam, Gordon V. Cormack, David R. Cheri...