We present a novel system for automatically marking up text documents into XML and discuss the benefits of XML markup for intelligent information retrieval. The system uses the Se...
Abstract: With new techniques, such as Microformats or RDFa, for integrating semantics into existing web formats, we expect a strong increase of semantically annotated documents in...
The method herein proposed detects text lines on handwritten pages which may include either lines oriented in several directions, erasures, or annotationsbetween main lines. The m...
Because of the complexity of documents and the variety of applications which must be supported, document understanding requires the integration of image understanding with text un...
Suzanne Liebowitz Taylor, Deborah A. Dahl, Mark Li...
This paper presents a text/graphic labelling for ancient printed documents. Our approach is based on the extraction and the quantification of the various orientations that are pre...