A Cluster-Based Plagiarism Detection Method - Lab Report for PAN at CLEF 2010

15 years 7 months ago

Download www.webis.de

In this paper we describe a cluster-based plagiarism detection method, which we have used in the learning management system of SCUT to detect plagiarism in the network engineering related courses. And we also used this method to detect external plagiarism in the PAN-10 competition. The method is divided into three steps: the first step, called pre-selecting, is to narrow the scope of detection using the successive same fingerprint; the second step, called locating, is to find and merge all fragments between two documents using cluster method; the third step, called post-processing, is to deal with some merging errors. Our method ran 19 hours in the PAN-10 competition, and the result ranked the second place, which met our expectation. Keywords. Plagiarism detection, Similar text, Locating, Cluster

Du Zou, Wei-jiang Long, Zhang Ling

Real-time Traffic

CLEF 2010 | Information Technology | PAN-10 Competition | Plagiarism | Plagiarism Detection Method |

claim paper

Post Info
More Details (n/a)

Added	08 Nov 2010
Updated	18 Dec 2011
Type	Conference
Year	2010
Where	CLEF
Authors	Du Zou, Wei-jiang Long, Zhang Ling

Comments (0)

Sciweavers

A Cluster-Based Plagiarism Detection Method - Lab Report for PAN at CLEF 2010

CLEF 2010 | Information Technology | PAN-10 Competition | Plagiarism | Plagiarism Detection Method |

Explore & Download

Productivity Tools

Sciweavers