Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

149

DOCENG
2005
ACM

99views Document Analysis» more DOCENG 2005»

Structuring documents according to their table of contents

15 years 8 months ago

Structuring documents according to their table of contents

Download www.xrce.xerox.com

In this paper, we present a method for structuring a document according to the information present in its Table of Contents. The detection of the ToC as well as the determination of the parts it refers to in the document body rely on a series of generic properties characterizing any ToC, while its hierarchization is achieved using clustering techniques. We also report on the robustness and performance of the method before discussing it, in light of related work. Categories and Subject Descriptors I.7.2 [Computing Methodologies]: Document and Text Processing - Document preparation Markup languages; I.7.4 [Computing Methodologies]: Document and Text Processing Electronic Publishing. I.7.5 [Computing Methodologies] Document Capture - Document analysis General Terms Algorithms, Documentation, Experimentation Keywords Document Structuring, Table of Contents recognition.

Hervé Déjean, Jean-Luc Meunier

Real-time Traffic

DOCENG 2005 | Document | Document Analysis | Document Preparation Markup | Keywords Document Structuring |

claim paper

Related Content

» Automated Detection and Segmentation of Table of Contents Page from Document Images

» VERT A Semantic Approach for Content Search and Content Extraction in XML Query Processing

» Feature and QueryBased Table of Contents Generation for XML Documents

» Analysis of Book Documents Table of Content Based on Clustering

» Automatic extraction of table metadata from digital documents

» A Table Detection Method for Multipage PDF Documents via Visual Seperators and Tabular Str...

» On the Reading of Tables of Contents

» Identifying table boundaries in digital documents via sparse line detection

» A Constraintbased Approach to Table Structure Derivation

Post Info
More Details (n/a)

Added	14 Oct 2010
Updated	14 Oct 2010
Type	Conference
Year	2005
Where	DOCENG
Authors	Hervé Déjean, Jean-Luc Meunier

Comments (0)