A Self-Training Learning Document Binarization Framework

15 years 6 months ago

Download www.comp.nus.edu.sg

—Document Image Binarization techniques have been studied for many years, and many practical binarization techniques have been developed and applied successfully on commercial document analysis systems. However, the current state-of-the-art methods, fail to produce good binarization results for many badly degraded document images. In this paper, we propose a self-training learning framework for document image binarization. Based on reported binarization methods, the proposed framework ﬁrst divides document image pixels into three categories, namely, foreground pixels, background pixels and uncertain pixels. A classiﬁer is then trained by learning from the document image pixels in the foreground and background categories. Finally, the uncertain pixels are classiﬁed using the learned pixel classiﬁer. Extensive experiments have been conducted over the dataset that is used in the recent Document Image Binarization Contest(DIBCO) 2009. Experimental results show that our proposed f...

Bolan Su, Shijian Lu, Chew Lim Tan

Real-time Traffic

Computer Vision | Document Image | ICPR 2010 | Image Binarization | Pixel |

claim paper

Post Info
More Details (n/a)

Added	07 Dec 2010
Updated	07 Dec 2010
Type	Conference
Year	2010
Where	ICPR
Authors	Bolan Su, Shijian Lu, Chew Lim Tan

Comments (0)

Sciweavers

A Self-Training Learning Document Binarization Framework

Computer Vision | Document Image | ICPR 2010 | Image Binarization | Pixel |

Explore & Download

Productivity Tools

Sciweavers