Wikipedia Vandalism Detection Through Machine Learning: Feature Review and New Proposals - Lab Report for PAN at CLEF 2010

14 years 4 months ago

Download www.uni-weimar.de

Wikipedia is an online encyclopedia that anyone can edit. In this open model, some people edits with the intent of harming the integrity of Wikipedia. This is known as vandalism. We extend the framework presented in (Potthast, Stein, and Gerling, 2008) for Wikipedia vandalism detection. In this approach, several vandalism indicating features are extracted from edits in a vandalism corpus and are fed to a supervised learning algorithm. The best performing classifiers were LogitBoost and Random Forest. Our classifier, a Random Forest, obtained an AUC of 0.92236, ranking in the first place of the PAN'10 Wikipedia vandalism detection task.

Santiago Moisés Mola-Velasco

Real-time Traffic

CLEF 2010 | Information Technology | Vandalism Detection Task | Vandalism Indicating Features | Wikipedia Vandalism Detection |

claim paper

Post Info
More Details (n/a)

Added	08 Nov 2010
Updated	08 Nov 2010
Type	Conference
Year	2010
Where	CLEF
Authors	Santiago Moisés Mola-Velasco

Comments (0)

Sciweavers

Wikipedia Vandalism Detection Through Machine Learning: Feature Review and New Proposals - Lab Report for PAN at CLEF 2010

CLEF 2010 | Information Technology | Vandalism Detection Task | Vandalism Indicating Features | Wikipedia Vandalism Detection |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers