Sciweavers

775 search results - page 18 / 155
» Email data cleaning
Sort
View
ML
2006
ACM
132views Machine Learning» more  ML 2006»
13 years 9 months ago
A suffix tree approach to anti-spam email filtering
We present an approach to email filtering based on the suffix tree data structure. A method for the scoring of emails using the suffix tree is developed and a number of scoring and...
Rajesh Pampapathi, Boris Mirkin, Mark Levene
CEAS
2006
Springer
14 years 1 months ago
Using Early Results from the 'spamHINTS' Project to Estimate an ISP Abuse Team's Task
ISPs operate "abuse" teams to deal with reports of inappropriate email being sent by their customers. Currently, the majority of this work is dealing with insecure syste...
Richard Clayton
ADMA
2008
Springer
114views Data Mining» more  ADMA 2008»
14 years 4 months ago
Using Data Mining Methods to Predict Personally Identifiable Information in Emails
Private information management and compliance are important issues nowadays for most of organizations. As a major communication tool for organizations, email is one of the many pot...
Liqiang Geng, Larry Korba, Xin Wang, Yunli Wang, H...
CHI
2003
ACM
14 years 10 months ago
Marked for deletion: an analysis of email data
What characteristics of an email message make it more likely to be discarded? Statistical analyses of a set of deleted and non-deleted messages revealed several factors that were ...
Laura Dabbish, Gina Danielle Venolia, Jonathan J. ...
PVLDB
2010
159views more  PVLDB 2010»
13 years 8 months ago
Explore or Exploit? Effective Strategies for Disambiguating Large Databases
Data ambiguity is inherent in applications such as data integration, location-based services, and sensor monitoring. In many situations, it is possible to “clean”, or remove, ...
Reynold Cheng, Eric Lo, Xuan Yang, Ming-Hay Luk, X...