Naive Bayes Spam Filtering Using Word-Position-Based Attributes

14 years 6 months ago

Download www.ceas.cc

This paper explores the use of the naive Bayes classiﬁer as the basis for personalised spam ﬁlters. Several machine learning algorithms, including variants of naive Bayes, have previously been used for this purpose, but the author’s implementation using wordposition-based attribute vectors gave very good results when tested on several publicly available corpora. The eﬀects of various forms of attribute selection—removal of frequent and infrequent words, respectively, and by using mutual information—are investigated. It is also shown how n-grams, with n > 1, may be used to boost classiﬁcation performance. Finally, an eﬃcient weighting scheme for cost-sensitive classiﬁcation is introduced.

Johan Hovold

Real-time Traffic

CEAS 2005 | Naive Bayes | Naive Bayes Classiﬁer | Wordposition-based Attribute Vectors |

claim paper

Post Info
More Details (n/a)

Added	26 Jun 2010
Updated	26 Jun 2010
Type	Conference
Year	2005
Where	CEAS
Authors	Johan Hovold

Comments (0)

Sciweavers

Naive Bayes Spam Filtering Using Word-Position-Based Attributes

CEAS 2005 | Naive Bayes | Naive Bayes Classiﬁer | Wordposition-based Attribute Vectors |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers