Search Sciweavers | Sciweavers

9 search results - page 1 / 2

» Word Length n-Grams for Text Re-use Detection

164

click to vote

CICLING
2010
Springer

174views Natural Language Processing» more CICLING 2010»

Word Length n-Grams for Text Re-use Detection

15 years 10 months ago

Download users.dsic.upv.es

Abstract. The automatic detection of shared content in written documents –which includes text reuse and its unacknowledged commitment, plagiarism– has become an important probl...

Alberto Barrón-Cedeño, Chiara Basile...

claim paper

Read More »

140

Voted

ECIR
2009
Springer

155views Information Technology» more ECIR 2009»

On Automatic Plagiarism Detection Based on n-Grams Comparison

16 years 3 months ago

Download users.dsic.upv.es

Abstract. When automatic plagiarism detection is carried out considering a reference corpus, a suspicious text is compared to a set of original documents in order to relate the pla...

Alberto Barrón-Cedeño, Paolo Rosso

claim paper

Read More »

175

click to vote

DRR
2009

166views Document Analysis» more DRR 2009»

Text-image alignment for historical handwritten documents

15 years 3 months ago

Download vca.ele.tue.nl

We describe our work on text-image alignment in context of building a historical document retrieval system. We aim at aligning images of words in handwritten lines with their text...

Svitlana Zinger, John Nerbonne, Lambert Schomaker

claim paper

Read More »

159

click to vote

ICDAR
2007
IEEE

159views Document Analysis» more ICDAR 2007»

An Efficient Word Segmentation Technique for Historical and Degraded Machine-Printed Documents

16 years 5 days ago

Download users.iit.demokritos.gr

Word segmentation is a crucial step for segmentation-free document analysis systems and is used for creating an index based on word matching. In this paper, we propose a novel met...

Michael Makridis, N. Nikolaou, Basilios Gatos

claim paper

Read More »

127

click to vote

COLING
1996

160views Computational Linguistics» more COLING 1996»

The Automatic Extraction of Open Compounds from Text Corpora

15 years 7 months ago

Download acl.ldc.upenn.edu

This paper describes a new method for extracting open compounds (uninterrupted sequences of words) from text corpora of languages, such as Thai, Japanese and Korea that exhibit un...

Virach Sornlertlamvanich, Hozumi Tanaka

claim paper

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers