Sciweavers

NLDB
2007
Springer

Developing Methods and Heuristics with Low Time Complexities for Filtering Spam Messages

14 years 6 months ago
Developing Methods and Heuristics with Low Time Complexities for Filtering Spam Messages
In this paper, we propose methods and heuristics having high accuracies and low time complexities for filtering spam e-mails. The methods are based on the n-gram approach and a heuristics which is referred to as the first n-words heuristics is devised. Though the main concern of the research is studying the applicability of these methods on Turkish e-mails, they were also applied to English e-mails. A data set for both languages was compiled. Extensive tests were performed with different parameters. Success rates of about 97% for Turkish e-mails and above 98% for English e-mails were obtained. In addition, it has been shown that the time complexities can be reduced significantly without sacrificing from success.
Tunga Güngör, Ali Çiltik
Added 08 Jun 2010
Updated 08 Jun 2010
Type Conference
Year 2007
Where NLDB
Authors Tunga Güngör, Ali Çiltik
Comments (0)