Sciweavers

12 search results - page 2 / 3
» Object-level document analysis of PDF files
Sort
View
DAS
2006
Springer
14 years 2 months ago
A System for Converting PDF Documents into Structured XML Format
We present in this paper a system for converting PDF legacy documents into structured XML format. This conversion system first extracts the different streams contained in PDF files...
Hervé Déjean, Jean-Luc Meunier
ICDAR
2003
IEEE
14 years 3 months ago
Extraction, layout analysis and classification of diagrams in PDF documents
Diagrams are a critical part of virtually all scientific and technical documents. Analyzing diagrams will be important for building comprehensive document retrieval systems. This ...
Robert P. Futrelle, Mingyan Shao, Chris Cieslik, A...
DAS
2008
Springer
14 years 4 days ago
Dolores: An Interactive and Class-Free Approach for Document Logical Restructuring
Physical and logical structure recovering from electronic documents is still an open issue. In this paper, we propose a flexible and efficient approach for recovering document str...
Jean-Luc Bloechle, Catherine Pugin, Rolf Ingold
DIAL
2004
IEEE
156views Image Analysis» more  DIAL 2004»
14 years 2 months ago
Xed: A New Tool for eXtracting Hidden Structures from Electronic Documents
PDF became a very common format for exchanging printable documents. Further, it can be easily generated from the major documents formats, which make a huge number of PDF documents...
Karim Hadjar, Maurizio Rigamonti, Denis Lalanne, R...
DGO
2010
173views Education» more  DGO 2010»
13 years 11 months ago
Digital sustainable publication of legacy parliamentary proceedings
We address the problem of publishing parliamentary proceedings in a digital sustainable manner. We give an extensive requirements analysis, and based on that propose a uniform XML...
Maarten Marx, Nelleke Aders, Anne Schuth