Spam Sender Detection with Classification Modeling on Highly Imbalanced Mail Server Behavior Data

15 years 3 months ago

Download www.trustedsource.org

Unsolicited commercial or bulk emails or emails containing viruses pose a great threat to the utility of email communications. A recent solution for filtering is reputation systems that can assign a value of trust to each IP address sending email messages. By analyzing the query patterns of each node utilizing reputation information, reputation systems can calculate a reputation score for each queried IP address. In this research, we explore a behavioral classification approach based on features extracted from such global messaging patterns. Due to the large amount of bad senders, this classification task has to cope with highly imbalanced data. Firstly, for each observed sender, we calculate periodicity properties using a discrete Fourier transform and global breadth information reflecting message volume and recipient distribution. After that, a Granular Support Vector Machine - Boundary Alignment algorithm (GSVM-BA) is implemented to solve the class imbalance problem and compared to ...

Yuchun Tang, Sven Krasser, Dmitri Alperovitch, Pau

Real-time Traffic

AIPRF 2008 | Artificial Intelligence | IP Address | Reputation Systems | Support Vector Machine |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	AIPRF
Authors	Yuchun Tang, Sven Krasser, Dmitri Alperovitch, Paul Judge

Comments (0)

Sciweavers

Spam Sender Detection with Classification Modeling on Highly Imbalanced Mail Server Behavior Data

AIPRF 2008 | Artificial Intelligence | IP Address | Reputation Systems | Support Vector Machine |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers