Sciweavers

AIRWEB
2006
Springer

Improving Cloaking Detection using Search Query Popularity and Monetizability

14 years 4 months ago
Improving Cloaking Detection using Search Query Popularity and Monetizability
Cloaking is a search engine spamming technique used by some Web sites to deliver one page to a search engine for indexing while serving an entirely different page to users browsing the site. In this paper, we show that the degree of cloaking among search results depends on query properties such as popularity and monetizability. We propose estimating query popularity and monetizability by analyzing search engine query logs and online advertising click-through logs, respectively. We also present a new measure for detecting cloaked URLs that uses a normalized term frequency ratio between multiple downloaded copies of Web pages. Experiments are conducted using 10,000 search queries and 3 million associated search result URLs. Experimental results indicate that while only 73.1% of the cloaked popular search URLs are spam, over 98.5% of the cloaked monetizable search URLs are spam. Further, on average, the search results for top 2% most cloaked queries are 10x more likely to be cloaking tha...
Kumar Chellapilla, David Maxwell Chickering
Added 20 Aug 2010
Updated 20 Aug 2010
Type Conference
Year 2006
Where AIRWEB
Authors Kumar Chellapilla, David Maxwell Chickering
Comments (0)