Sciweavers

ICTAI
2007
IEEE

Automatic Personalized Spam Filtering through Significant Word Modeling

14 years 6 months ago
Automatic Personalized Spam Filtering through Significant Word Modeling
Typically, spam filters are built on the assumption that the characteristics of e-mails in the training set is identical to those in individual users’ inboxes on which it will be applied. This assumption is oftentimes incorrect leading to poor performance of the filter. A personalized spam filter is built by taking into account the characteristics of e-mails in individual users’ inboxes. We present an automatic approach for personalized spam filtering that does not require users’ feedback. The proposed algorithm builds a statistical model of significant spam and non-spam words from the labeled training set and then updates it in multiple passes over the unlabeled individual user’s inbox. The personalization of the model leads to improved filtering performance. We evaluate our algorithm on two publicly available datasets. The results show that our algorithm is robust and scalable, and a viable solution to the server-side personalized spam filtering problem. Moreover, it outperf...
Khurum Nazir Junejo, Asim Karim
Added 03 Jun 2010
Updated 03 Jun 2010
Type Conference
Year 2007
Where ICTAI
Authors Khurum Nazir Junejo, Asim Karim
Comments (0)