Sciweavers

37 search results - page 4 / 8
» Extending Page Segmentation Algorithms for Mixed-Layout Docu...
Sort
View
DIAL
2006
IEEE
243views Image Analysis» more  DIAL 2006»
14 years 1 months ago
AGORA: the Interactive Document Image Analysis Tool of the BVH Project
In this paper, we describe how meta-data of indexation can be extracted from historical document images using an interactive process with a software called AGORA. The algorithms i...
Jean-Yves Ramel, S. Busson, M. L. Demonet
DOCENG
2009
ACM
14 years 1 months ago
Object-level document analysis of PDF files
The PDF format is commonly used for the exchange of documents on the Web and there is a growing need to understand and extract or repurpose data held in PDF documents. Many system...
Tamir Hassan
DAS
2006
Springer
13 years 11 months ago
Aligning Transcripts to Automatically Segmented Handwritten Manuscripts
Abstract. Training and evaluation of techniques for handwriting recognition and retrieval is a challenge given that it is difficult to create large ground-truthed datasets. This is...
Jamie L. Rothfeder, R. Manmatha, Toni M. Rath
ICPR
2008
IEEE
14 years 1 months ago
Ancient document analysis based on text line extraction
In order to preserve our cultural heritage and for automated document processing libraries and national archives have started digitizing historical documents. In the case of degra...
Florian Kleber, Robert Sablatnig, Melanie Gau, Hei...
DOCENG
2003
ACM
14 years 18 days ago
Creating reusable well-structured PDF as a sequence of component object graphic (COG) elements
Portable Document Format (PDF) is a page-oriented, graphically rich format based on PostScript semantics and it is also the format interpreted by the Adobe Acrobat viewers. Althou...
Steven R. Bagley, David F. Brailsford, Matthew R. ...