Identifying web spam with user behavior analysis

14 years 1 months ago

Download airweb.cse.lehigh.edu

Combating Web spam has become one of the top challenges for Web search engines. State-of-the-art spam detection techniques are usually designed for specific known types of Web spam and are incapable and inefficient for newly-appeared spam. With user behavior analyses into Web access logs, we propose a spam page detection algorithm based on Bayesian Learning. The main contributions of our work are: (1) User visiting patterns of spam pages are studied and three user behavior features are proposed to separate Web spam from ordinary ones. (2) A novel spam detection framework is proposed that can detect unknown spam types and newly-appeared spam with the help of user behavior analysis. Preliminary experiments on large scale Web access log data (containing over 2.74 billion user clicks) show the effectiveness of the proposed features and detection framework. Categories and Subject Descriptors H.3.3 [Information Search and Retrieval]: Search process, H.3.4 [Systems and Software]: Performance...

Yiqun Liu, Rongwei Cen, Min Zhang, Shaoping Ma, Li

Real-time Traffic

AIRWEB 2008 | Internet Technology | Spam Detection | User Behavior | Web Spam |

claim paper

Post Info
More Details (n/a)

Added	12 Oct 2010
Updated	12 Oct 2010
Type	Conference
Year	2008
Where	AIRWEB
Authors	Yiqun Liu, Rongwei Cen, Min Zhang, Shaoping Ma, Liyun Ru

Comments (0)

Sciweavers

Identifying web spam with user behavior analysis

AIRWEB 2008 | Internet Technology | Spam Detection | User Behavior | Web Spam |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers