Document Layout Substructure Discovery

16 years 21 days ago

Download tev.fbk.eu

Abstract. In this paper we present a system, DoLSuD, for the automatic discovery of relevant substructures in a document layout. DoLSuD, Document Layout Substructure Discovery, extracts, analyzes and describes the visual content of structured digital documents, such as catalogs, in order to discover repeating and distinctive substructures in the document layout and to establish relations between textual and image content. Establishing meaningful links from the catalog structure between images and text paragraphs allows us to exploit the semantic annotation of the textual part to annotate the images and integrate multimedia processing and Semantic Web technologies. The paper presents the system along with experimental results and the web based service which utilizes the analysis results.

Claudio Andreatta

Real-time Traffic

Document Layout | Document Layout Substructure | Relevant Substructures | SAMT 2007 |

claim paper

» SAMetaMatch relevant document discovery through document metadata and indexing

» Key semantics extraction by dependency tree mining

Post Info
More Details (n/a)

Added	09 Jun 2010
Updated	09 Jun 2010
Type	Conference
Year	2007
Where	SAMT
Authors	Claudio Andreatta

Comments (0)

Sciweavers

Document Layout Substructure Discovery

Document Layout | Document Layout Substructure | Relevant Substructures | SAMT 2007 |

Explore & Download

Productivity Tools

Sciweavers