Text - Image Separation in Devanagari Documents

16 years 2 days ago

Download www.cse.salford.ac.uk

In this paper we present a top-down, projection-proﬁle based algorithm to separate text blocks from image blocks in a Devanagari document. We use a distinctive feature of Devanagari text, called Shirorekha (Header Line) to analyze the pattern produced by Devanagari text in the horizontal proﬁle. The horizontal proﬁle corresponding to a text block possesses certain regularity in frequency, orientation and shows spatial cohesion. The algorithm uses these features to identify text blocks in a document image containing both text and graphics.

Swapnil Khedekar, Vemulapati Ramanaprasad, Srirang

Real-time Traffic

Devanagari Text | Document Analysis | ICDAR 2003 | Separate Text Blocks | Text Blocks |

claim paper

» Text Separation from Mixed Documents Using a TreeStructured Classifier

» Touching Character Separation in Chinese Handwriting Using VisibilityBased Foreground Anal...

» Learning to Separate Text Content and Style for Classification

» Document Understanding System Using Stochastic ContextFree Grammars

» TextGraphics Segmentation in Architectural Floor Plans

» Hybrid Indexing and Seamless Ranking of Spatial and Textual Features of Web Documents

» On Document Classification with SelfOrganising Maps

» Resolving Ambiguities in Toponym Recognition in Cartographic Maps

» Semantic enrichment of text representation with wikipedia for text classification

Post Info
More Details (n/a)

Added	04 Jul 2010
Updated	04 Jul 2010
Type	Conference
Year	2003
Where	ICDAR
Authors	Swapnil Khedekar, Vemulapati Ramanaprasad, Srirangaraj Setlur, Venu Govindaraju

Comments (0)

Sciweavers

Text - Image Separation in Devanagari Documents

Devanagari Text | Document Analysis | ICDAR 2003 | Separate Text Blocks | Text Blocks |

Explore & Download

Productivity Tools

Sciweavers