Sciweavers

466 search results - page 14 / 94
» Scalable Feature Extraction from Noisy Documents
Sort
View
DIAL
2006
IEEE
243views Image Analysis» more  DIAL 2006»
14 years 3 months ago
AGORA: the Interactive Document Image Analysis Tool of the BVH Project
In this paper, we describe how meta-data of indexation can be extracted from historical document images using an interactive process with a software called AGORA. The algorithms i...
Jean-Yves Ramel, S. Busson, M. L. Demonet
CLEF
2010
Springer
13 years 11 months ago
A Textual-Based Similarity Approach for Efficient and Scalable External Plagiarism Analysis - Lab Report for PAN at CLEF 2010
In this paper we present an approach to detect external plagiarism based on textual similarity. This is an efficient and precise method that can be applied over large sets of docum...
Daniel Micol, Óscar Ferrández, Ferna...
DKE
2007
119views more  DKE 2007»
13 years 9 months ago
XML subtree reconstruction from relational storage of XML documents
Numerous researchers have proposed to use relational databases to store and query XML documents. In these systems, the elements selected by an XML query are returned to an applica...
Artem Chebotko, Mustafa Atay, Shiyong Lu, Farshad ...
ICDAR
2011
IEEE
12 years 9 months ago
A Handwritten Character Extraction Algorithm for Multi-language Document Image
—In this paper, we propose a novel method for extracting handwritten characters from multi-language document images, which may contain various types of characters, e.g. Chinese, ...
Yonghong Song, Guilin Xiao, Yuanlin Zhang, Lei Yan...
PRIS
2003
13 years 11 months ago
Numerical Field Extraction in Handwritten Incoming Mail Documents
In this communication, we propose a method for the automatic extraction of numerical fields in handwritten documents. The approach exploits the known syntactic structure of the num...
Guillaume Koch, Laurent Heutte, Thierry Paquet