Sciweavers

ICDIM
2006
IEEE

A Framework for the Encoding of Multilayered Documents

14 years 5 months ago
A Framework for the Encoding of Multilayered Documents
Electronic publishing of material digitized using imaging and OCR calls for a special delivery format capable of reconstructing original documents in a well-usable electronic form. We present a framework for the universal encoding of multilingual image-ontext documents, enabling retrieval systems to textsearch and highlight hits on original page images. A generalized format for representation of image-on-text allows for integration of different OCR engines and target format encoders. This framework’s current implementation encodes multilingual content into DjVu and PDF. Performance has been evaluated with focus on file size and shown that overhead of adding text layers is small compared to advantages and that output is comparable to other systems.
Youssef Eldakar, Noha Adly, Magdy Nagi
Added 11 Jun 2010
Updated 11 Jun 2010
Type Conference
Year 2006
Where ICDIM
Authors Youssef Eldakar, Noha Adly, Magdy Nagi
Comments (0)