A Comparison of Event Models for Naive Bayes Anti-Spam E-Mail Filtering

14 years 1 months ago

Download ke.cse.nsysu.edu.tw

We describe experiments with a Naive Bayes text classiﬁer in the context of anti-spam E-mail ﬁltering, using two different statistical event models: a multi-variate Bernoulli model and a multinomial model. We introduce a family of feature ranking functions for feature selection in the multinomial event model that take account of the word frequency information. We present evaluation results on two publicly available corpora of legitimate and spam E-mails. We ﬁnd that the multinomial model is less biased towards one class and achieves slightly higher accuracy than the multi-variate Bernoulli model.

Karl-Michael Schneider

Real-time Traffic

EACL 2003 | Multi-variate Bernoulli Model | Multinomial Event Model | Multinomial Model | Natural Language Processing |

claim paper

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2003
Where	EACL
Authors	Karl-Michael Schneider

Comments (0)

Sciweavers

A Comparison of Event Models for Naive Bayes Anti-Spam E-Mail Filtering

EACL 2003 | Multi-variate Bernoulli Model | Multinomial Event Model | Multinomial Model | Natural Language Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers