Thwarting the Nigritude Ultramarine: Learning to Identify Link Spam

14 years 6 months ago

Download www.cs.uni-potsdam.de

The page rank of a commercial web site has an enormous economic impact because it directly inﬂuences the number of potential customers that ﬁnd the site as a highly ranked search engine result. Link spamming – inﬂating the page rank of a target page by artiﬁcially creating many referring pages – has therefore become a common practice. In order to maintain the quality of their search results, search engine providers try to oppose eﬀorts that decorrelate page rank and relevance and maintain blacklists of spamming pages while spammers, at the same time, try to camouﬂage their spam pages. We formulate the problem of identifying link spam and discuss a methodology for generating training data. Experiments reveal the eﬀectiveness of classes of intrinsic and relational attributes and shed light on the robustness of classiﬁers against obfuscation of attributes by an adversarial spammer. We identify open research problems related to web spam.

Isabel Drost, Tobias Scheffer

Real-time Traffic

ECML 2005 | Enormous Economic Impact | Page Rank | Search Engine |

claim paper

Post Info
More Details (n/a)

Added	27 Jun 2010
Updated	27 Jun 2010
Type	Conference
Year	2005
Where	ECML
Authors	Isabel Drost, Tobias Scheffer

Comments (0)

Sciweavers

Thwarting the Nigritude Ultramarine: Learning to Identify Link Spam

ECML 2005 | Enormous Economic Impact | Page Rank | Search Engine |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers