Sciweavers

ICDAR
2003
IEEE

Text - Image Separation in Devanagari Documents

14 years 5 months ago
Text - Image Separation in Devanagari Documents
In this paper we present a top-down, projection-profile based algorithm to separate text blocks from image blocks in a Devanagari document. We use a distinctive feature of Devanagari text, called Shirorekha (Header Line) to analyze the pattern produced by Devanagari text in the horizontal profile. The horizontal profile corresponding to a text block possesses certain regularity in frequency, orientation and shows spatial cohesion. The algorithm uses these features to identify text blocks in a document image containing both text and graphics.
Swapnil Khedekar, Vemulapati Ramanaprasad, Srirang
Added 04 Jul 2010
Updated 04 Jul 2010
Type Conference
Year 2003
Where ICDAR
Authors Swapnil Khedekar, Vemulapati Ramanaprasad, Srirangaraj Setlur, Venu Govindaraju
Comments (0)