Sciweavers

AIPRF
2008

Spam Sender Detection with Classification Modeling on Highly Imbalanced Mail Server Behavior Data

14 years 1 months ago
Spam Sender Detection with Classification Modeling on Highly Imbalanced Mail Server Behavior Data
Unsolicited commercial or bulk emails or emails containing viruses pose a great threat to the utility of email communications. A recent solution for filtering is reputation systems that can assign a value of trust to each IP address sending email messages. By analyzing the query patterns of each node utilizing reputation information, reputation systems can calculate a reputation score for each queried IP address. In this research, we explore a behavioral classification approach based on features extracted from such global messaging patterns. Due to the large amount of bad senders, this classification task has to cope with highly imbalanced data. Firstly, for each observed sender, we calculate periodicity properties using a discrete Fourier transform and global breadth information reflecting message volume and recipient distribution. After that, a Granular Support Vector Machine - Boundary Alignment algorithm (GSVM-BA) is implemented to solve the class imbalance problem and compared to ...
Yuchun Tang, Sven Krasser, Dmitri Alperovitch, Pau
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2008
Where AIPRF
Authors Yuchun Tang, Sven Krasser, Dmitri Alperovitch, Paul Judge
Comments (0)