Sciweavers

AIRWEB
2008
Springer

Identifying web spam with user behavior analysis

14 years 1 months ago
Identifying web spam with user behavior analysis
Combating Web spam has become one of the top challenges for Web search engines. State-of-the-art spam detection techniques are usually designed for specific known types of Web spam and are incapable and inefficient for newly-appeared spam. With user behavior analyses into Web access logs, we propose a spam page detection algorithm based on Bayesian Learning. The main contributions of our work are: (1) User visiting patterns of spam pages are studied and three user behavior features are proposed to separate Web spam from ordinary ones. (2) A novel spam detection framework is proposed that can detect unknown spam types and newly-appeared spam with the help of user behavior analysis. Preliminary experiments on large scale Web access log data (containing over 2.74 billion user clicks) show the effectiveness of the proposed features and detection framework. Categories and Subject Descriptors H.3.3 [Information Search and Retrieval]: Search process, H.3.4 [Systems and Software]: Performance...
Yiqun Liu, Rongwei Cen, Min Zhang, Shaoping Ma, Li
Added 12 Oct 2010
Updated 12 Oct 2010
Type Conference
Year 2008
Where AIRWEB
Authors Yiqun Liu, Rongwei Cen, Min Zhang, Shaoping Ma, Liyun Ru
Comments (0)