Sciweavers

CEAS
2006
Springer
14 years 3 months ago
Batch and Online Spam Filter Comparison
Gordon V. Cormack, Andrej Bratko
CEAS
2006
Springer
14 years 3 months ago
"Sorry, I Forgot the Attachment": Email Attachment Prediction
The missing attachment problem: a missing attachment generates a wave of emails from the recipients notifying the sender of the error. We present an attachment prediction system t...
Mark Dredze, John Blitzer, Fernando Pereira
CEAS
2006
Springer
14 years 3 months ago
Email Thread Reassembly Using Similarity Matching
Email thread reassembly is the task of linking messages by parentchild relationships. In this paper, we present two approaches to address this problem. One exploits previously und...
Jen-Yuan Yeh
CEAS
2006
Springer
14 years 3 months ago
Spam Filtering with Naive Bayes - Which Naive Bayes?
Naive Bayes is very popular in commercial and open-source anti-spam e-mail filters. There are, however, several forms of Naive Bayes, something the anti-spam literature does not a...
Vangelis Metsis, Ion Androutsopoulos, Georgios Pal...
CEAS
2006
Springer
14 years 3 months ago
Fast Uncertainty Sampling for Labeling Large E-mail Corpora
One of the biggest challenges in building effective anti-spam solutions is designing systems to defend against the everevolving bag of tricks spammers use to defeat them. Because ...
Richard Segal, Ted Markowitz, William Arnold
CEAS
2006
Springer
14 years 3 months ago
Dynamic Port 25 Blocking to Control SPAM Zombies
This paper presents the results of a case study in which outbound SPAM, here referring to excessive amounts of bulk-generated email, is suppressed using dynamic Port 25 blocking. ...
Jonathan Schmidt
CEAS
2006
Springer
14 years 3 months ago
Online Discriminative Spam Filter Training
We describe a very simple technique for discriminatively training a spam filter. Our results on the TREC Enron spam corpus would have been the best for the Ham at .1% measure, and...
Joshua Goodman, Wen-tau Yih
CEAS
2006
Springer
14 years 3 months ago
Annotating Subsets of the Enron Email Corpus
We present an annotation project for two subsets of the Enron email corpus. The first is a subset of the UC Berkeley Enron Email Analysis Project and the second consists of a port...
Jade Goldstein, Andres Kwasinksi, Paul Kingsbury, ...