—Handwritten text line segmentation on real-world data presents significant challenges that cannot be overcome by any single technique. Given the diversity of approaches and the...
– We describe a method to extract content text from diverse Web pages by using the HTML document’s Text-to-Tag Ratio rather than specific HTML cues that may not be constant acr...
Skew estimation and page segmentation are the two closely related processing stages for document image analysis. Skew estimation needs proper page segmentation, especially for doc...
We present a spatially variant framework for correcting uneven illumination and color cast, problems commonly associated with digitized books. The core of our method is a color im...
In this article we present a novel fully automatic character segmentation for camera-based images. This is a top-down approach inspired by the human visual system: the high level ...