Finding Images and Line-Drawings in Document-Scanning Systems

15 years 8 months ago

Download www.mangolassi.org

The system presented in this paper finds images and line-drawings in scanned pages; it is a crucial processing step in the creation of a large-scale system to detect and index images found in books and historic documents. Within the scanned pages that contain both text and images, the images are found through the use of SIFT-based local-features applied to the complete scanned-page. This is followed by a novel learning system to categorize the found SIFT features into either text or image. The discrimination is based on using multiple classifiers trained via AdaBoost. Through the use of this system, we improve image detection by finding more line-drawings, graphics, and photographs, as well as by reducing the number of spurious detections due to misclassified text, discolorations, and scanning artifacts.

Shumeet Baluja, Michele Covell

Real-time Traffic

Crucial Processing Step | Document Analysis | ICDAR 2009 | Index Images | Paper Finds Images |

claim paper

Post Info
More Details (n/a)

Added	21 May 2010
Updated	21 May 2010
Type	Conference
Year	2009
Where	ICDAR
Authors	Shumeet Baluja, Michele Covell

Comments (0)

Sciweavers

Finding Images and Line-Drawings in Document-Scanning Systems

Crucial Processing Step | Document Analysis | ICDAR 2009 | Index Images | Paper Finds Images |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers