Digipaper: A Versatile Color Document Image Representation

15 years 2 months ago

Download www.cs.cornell.edu

We describe a segmentation method and associated file format for storing images of color documents. We separate each page of the document into three layers, containing the background (usually one or more photographic images), the text, and the color of the text. Each of these layers has different properties, making it desirable to use different compression methods to represent the three layers. The background layers are compressed using any method designed for photographic images, the text layers are compressed using a token-based representation, and the text color layers are compressed by augmenting the representation used for the text layers. We also describe an algorithm for segmenting images into these three layers. This representation and algorithm can produce very highly-compressed document files that nonetheless retain excellent image quality.

Daniel P. Huttenlocher, Pedro F. Felzenszwalb, Wil

Real-time Traffic

Background Layers | ICIP 1999 | Image Processing | Text Color Layers | Text Layers |

claim paper

Post Info
More Details (n/a)

Added	25 Oct 2009
Updated	26 Oct 2009
Type	Conference
Year	1999
Where	ICIP
Authors	Daniel P. Huttenlocher, Pedro F. Felzenszwalb, William Rucklidge

Comments (0)

Sciweavers

Digipaper: A Versatile Color Document Image Representation

Background Layers | ICIP 1999 | Image Processing | Text Color Layers | Text Layers |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers