Sciweavers

CLEF
2010
Springer

A Cluster-Based Plagiarism Detection Method - Lab Report for PAN at CLEF 2010

14 years 1 months ago
A Cluster-Based Plagiarism Detection Method - Lab Report for PAN at CLEF 2010
In this paper we describe a cluster-based plagiarism detection method, which we have used in the learning management system of SCUT to detect plagiarism in the network engineering related courses. And we also used this method to detect external plagiarism in the PAN-10 competition. The method is divided into three steps: the first step, called pre-selecting, is to narrow the scope of detection using the successive same fingerprint; the second step, called locating, is to find and merge all fragments between two documents using cluster method; the third step, called post-processing, is to deal with some merging errors. Our method ran 19 hours in the PAN-10 competition, and the result ranked the second place, which met our expectation. Keywords. Plagiarism detection, Similar text, Locating, Cluster
Du Zou, Wei-jiang Long, Zhang Ling
Added 08 Nov 2010
Updated 18 Dec 2011
Type Conference
Year 2010
Where CLEF
Authors Du Zou, Wei-jiang Long, Zhang Ling
Comments (0)