Sciweavers

COLING
2010

Detection of Simple Plagiarism in Computer Science Papers

13 years 7 months ago
Detection of Simple Plagiarism in Computer Science Papers
Plagiarism is the use of the language and thoughts of another work and the representation of them as one's own original work. Various levels of plagiarism exist in many domains in general and in academic papers in particular. Therefore, diverse efforts are taken to automatically identify plagiarism. In this research, we developed software capable of simple plagiarism detection. We have built a corpus (C) containing 10,100 academic papers in computer science written in English and two test sets including papers that were randomly chosen from C. A widespread variety of baseline methods has been developed to identify identical or similar papers. Several methods are novel. The experimental results and their analysis show interesting findings. Some of the novel methods are among the best predictive methods.
Yaakov HaCohen-Kerner, Aharon Tayeb, Natan Ben-Dro
Added 13 May 2011
Updated 13 May 2011
Type Journal
Year 2010
Where COLING
Authors Yaakov HaCohen-Kerner, Aharon Tayeb, Natan Ben-Dror
Comments (0)