

Cantina: a content-based approach to detecting phishing web sites

15 years 1 months ago
Cantina: a content-based approach to detecting phishing web sites
Phishing is a significant problem involving fraudulent email and web sites that trick unsuspecting users into revealing private information. In this paper, we present the design, implementation, and evaluation of CANTINA, a novel, content-based approach to detecting phishing web sites, based on the TF-IDF information retrieval algorithm. We also discuss the design and evaluation of several heuristics we developed to reduce false positives. Our experiments show that CANTINA is good at detecting phishing sites, correctly labeling approximately 95% of phishing sites. Categories and Subject Descriptors C.2.0 [Computer-Communication Networks]: General ? Security and Protection, H.3.3 [Information Search and Retrieval]: Retrieval Models General Terms Algorithms, Measurement, Security, Human Factors Keywords Phishing, Anti-Phishing, TF-IDF, Toolbar, Evaluation
Jason I. Hong, Lorrie Faith Cranor, Yue Zhang 0002
Added 22 Nov 2009
Updated 22 Nov 2009
Type Conference
Year 2007
Where WWW
Authors Jason I. Hong, Lorrie Faith Cranor, Yue Zhang 0002
Comments (0)