Sciweavers

27 search results - page 3 / 6
» On document splitting in passage detection
Sort
View
CLEF
2010
Springer
13 years 8 months ago
A Textual-Based Similarity Approach for Efficient and Scalable External Plagiarism Analysis - Lab Report for PAN at CLEF 2010
In this paper we present an approach to detect external plagiarism based on textual similarity. This is an efficient and precise method that can be applied over large sets of docum...
Daniel Micol, Óscar Ferrández, Ferna...
PR
2002
129views more  PR 2002»
13 years 7 months ago
Text extraction in complex color documents
Text extraction in mixed-type documents is a pre-processing and necessary stage for many document applications. In mixed-type color documents, text, drawings and graphics appear w...
Charalambos Strouthopoulos, Nikos Papamarkos, Anto...
INEX
2007
Springer
14 years 1 months ago
Using and Detecting Links in Wikipedia
In this paper, we document our efforts at INEX 2007 where we participated in the Ad Hoc Track, the Link the Wiki Track, and the Interactive Track that continued from INEX 2006. Ou...
Khairun Nisa Fachry, Jaap Kamps, Marijn Koolen, Ju...
DAS
2010
Springer
13 years 5 months ago
Page frame detection for double page document images
Scanning two book pages at the same time helps to accelerate the scanning process but on the other hand introduces several difficulties if the user needs to have one page per imag...
Nikolaos Stamatopoulos, Basilios Gatos, Thodoris G...
ICDAR
2009
IEEE
14 years 2 months ago
Classifying Foreground Pixels in Document Images
We present a system that classifies pixels in a document image according to marking type such as machine print, handwriting, and noise. A segmenter module first splits an input ...
Prateek Sarkar, Eric Saund, Jing Lin