Sciweavers

2926 search results - page 51 / 586
» Document Analysis
Sort
View
ICDAR
2003
IEEE
14 years 3 months ago
Numerical Sequence Extraction in Handwritten Incoming Mail Documents
In this communication, we propose a method for the automatic extraction of numerical fields in handwritten documents. The approach exploits the known syntactic structure of the nu...
Guillaume Koch, Laurent Heutte, Thierry Paquet
ICDAR
2003
IEEE
14 years 3 months ago
Automatic Discovery of Semantic Structures in HTML Documents
Template-driven HTML documents posses an implicit, fixed schema denoting concepts and their relationships in a hierarchical fashion. Discovering this schema remains a relatively ...
Saikat Mukherjee, Guizhen Yang, Wenfang Tan, I. V....
ICDAR
2003
IEEE
14 years 3 months ago
Correcting the Document Layout: A Machine Learning Approach
In this paper, a machine learning approach to support the user during the correction of the layout analysis is proposed. Layout analysis is the process of extracting a hierarchica...
Donato Malerba, Floriana Esposito, Oronzo Altamura...
CIKM
1997
Springer
14 years 2 months ago
The Need for Metrics in Visual Information Analysis
CT This paper explores several methods for visualizing the thematic content of large document collections. As opposed to traditional query-driven document retrieval, these methods ...
Nancy Miller, Elizabeth G. Hetzler, Grant Nakamura...
DAS
2006
Springer
13 years 12 months ago
XCDF: A Canonical and Structured Document Format
Accessing the structured content of PDF document is a difficult task, requiring pre-processing and reverse engineering techniques. In this paper, we first present different methods...
Jean-Luc Bloechle, Maurizio Rigamonti, Karim Hadja...