A Framework for the Encoding of Multilayered Documents

14 years 6 months ago

Download www.bibalex.org

Electronic publishing of material digitized using imaging and OCR calls for a special delivery format capable of reconstructing original documents in a well-usable electronic form. We present a framework for the universal encoding of multilingual image-ontext documents, enabling retrieval systems to textsearch and highlight hits on original page images. A generalized format for representation of image-on-text allows for integration of different OCR engines and target format encoders. This framework’s current implementation encodes multilingual content into DjVu and PDF. Performance has been evaluated with focus on file size and shown that overhead of adding text layers is small compared to advantages and that output is comparable to other systems.

Youssef Eldakar, Noha Adly, Magdy Nagi

Real-time Traffic

ICDIM 2006 | Information Management | Multilingual Image-ontext Documents | Special Delivery Format | Well-usable Electronic Form |

claim paper

Post Info
More Details (n/a)

Added	11 Jun 2010
Updated	11 Jun 2010
Type	Conference
Year	2006
Where	ICDIM
Authors	Youssef Eldakar, Noha Adly, Magdy Nagi

Comments (0)

Sciweavers

A Framework for the Encoding of Multilayered Documents

ICDIM 2006 | Information Management | Multilingual Image-ontext Documents | Special Delivery Format | Well-usable Electronic Form |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers