Sciweavers

IJDAR
2006

Table form document analysis based on the document structure grammar

14 years 14 days ago
Table form document analysis based on the document structure grammar
Structure analysis of table form documents is an important issue because a printed document and even an electronic document do not provide logical structural information but merely geometrical layout and lexical information. To handle these documents automatically, logical structure information is necessary. In this paper, we first analyze the elements of the form documents from a communication point of view and retrieve the grammatical elements that appear in them. Then, we present a document structure grammar which governs the logical structure of the form documents. Finally, we propose a structure analysis system of the table form documents based on the grammar. By using grammar notation, we can easily modify and keep it consistent, as the rules are relatively simple. Another advantage of using grammar notation is that it can be used for generating documents only from logical structure. In our system, documents are assumed to be composed of a set of boxes and they are classified as ...
Akira Amano, Naoki Asada, Masayuki Mukunoki, Masah
Added 12 Dec 2010
Updated 12 Dec 2010
Type Journal
Year 2006
Where IJDAR
Authors Akira Amano, Naoki Asada, Masayuki Mukunoki, Masahito Aoyama
Comments (0)