Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

150

Voted

ECIR
2009
Springer

155views Information Technology» more ECIR 2009»

On Automatic Plagiarism Detection Based on n-Grams Comparison

16 years 3 months ago

On Automatic Plagiarism Detection Based on n-Grams Comparison

Download users.dsic.upv.es

Abstract. When automatic plagiarism detection is carried out considering a reference corpus, a suspicious text is compared to a set of original documents in order to relate the plagiarised text fragments to their potential source. One of the biggest diﬃculties in this task is to locate plagiarised fragments that have been modiﬁed (by rewording, insertion or deletion, for example) from the source text. The deﬁnition of proper text chunks as comparison units of the suspicious and original texts is crucial for the success of this kind of applications. Our experiments with the METER corpus show that the best results are obtained when considering low level word n-grams comparisons (n = {2, 3}).

Alberto Barrón-Cedeño, Paolo Rosso

Real-time Traffic

Computer Science | ECIR 2009 | Proper Text Chunks | Suspicious Text | Text Fragments |

claim paper

Related Content

» Word Length nGrams for Text Reuse Detection

» Intrinsic Plagiarism Detection

» Comparative evaluation of text and citationbased plagiarism detection approaches using gut...

» Citation based plagiarism detection a new approach to identify plagiarized work language i...

» Towards Document Plagiarism Detection Based on the Relevance and Fragmentation of the Reus...

» Reducing the Plagiarism Detection Search Space on the Basis of the KullbackLeibler Distanc...

» On Crosslingual Plagiarism Analysis using a Statistical Model

» Detection of Plagiarism in Student Essays

» Detection of Plagiarism in University Projects Using Metricsbased Spectral Similarity

Post Info
More Details (n/a)

Added	08 Mar 2010
Updated	08 Mar 2010
Type	Conference
Year	2009
Where	ECIR
Authors	Alberto Barrón-Cedeño, Paolo Rosso

Comments (0)