Automatic Vandalism Detection in Wikipedia

14 years 1 months ago

Download www.uni-weimar.de

Abstract. We present results of a new approach to detect destructive article revisions, so-called vandalism, in Wikipedia. Vandalism detection is a one-class classification problem, where vandalism edits are the target to be identified among all revisions. Interestingly, vandalism detection has not been addressed in the Information Retrieval literature by now. In this paper we discuss the characteristics of vandalism as humans recognize it and develop features to render vandalism detection as a machine learning task. We compiled a large number of vandalism edits in a corpus, which allows for the comparison of existing and new detection approaches. Using logistic regression we achieve 83% precision at 77% recall with our model. Compared to the rule-based methods that are currently applied in Wikipedia, our approach increases the F-Measure performance by 49% while being faster at the same time.

Martin Potthast, Benno Stein, Robert Gerling

Real-time Traffic

ECIR 2008 | Information Technology | So-called Vandalism | Vandalism Detection | Vandalism Edits |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	ECIR
Authors	Martin Potthast, Benno Stein, Robert Gerling

Comments (0)

Sciweavers

Automatic Vandalism Detection in Wikipedia

ECIR 2008 | Information Technology | So-called Vandalism | Vandalism Detection | Vandalism Edits |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers