Wiki Vandalysis - Wikipedia Vandalism Analysis - Lab Report for PAN at CLEF 2010

14 years 3 months ago

Download www.uni-weimar.de

Wikipedia describes itself as the "free encyclopedia that anyone can edit". Along with the helpful volunteers who contribute by improving the articles, a great number of malicious users abuse the open nature of Wikipedia by vandalizing articles. Deterring and reverting vandalism has become one of the major challenges of Wikipedia as its size grows. Wikipedia editors fight vandalism both manually and with automated bots that use regular expressions and other simple rules to recognize malicious edits[5]. Researchers have also proposed Machine Learning algorithms for vandalism detection[19,15], but these algorithms are still in their infancy and have much room for improvement. This paper presents an approach to fighting vandalism by extracting various features from the edits for machine learning classification. Our classifier uses information about the editor, the sentiment of the edit, the "quality" of the edit (i.e. spelling errors), and targeted regular expressions ...

Manoj Harpalani, Thanadit Phumprao, Megha Bassi, M

Real-time Traffic

CLEF 2010 | Editors Fight Vandalism | Information Technology | Regular Expressions | Wikipedia |

claim paper

Post Info
More Details (n/a)

Added	08 Nov 2010
Updated	08 Nov 2010
Type	Conference
Year	2010
Where	CLEF
Authors	Manoj Harpalani, Thanadit Phumprao, Megha Bassi, Michael Hart, Rob Johnson

Comments (0)

Sciweavers

Wiki Vandalysis - Wikipedia Vandalism Analysis - Lab Report for PAN at CLEF 2010

CLEF 2010 | Editors Fight Vandalism | Information Technology | Regular Expressions | Wikipedia |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers