Two-stage Approach for Word-wise Script Identification

15 years 4 months ago

Download www.cvc.uab.es

A two-stage approach for word-wise identification of English (Roman), Devnagari and Bengali (Bangla) scripts is proposed. This approach balances the tradeoff between recognition accuracy and processing speed. The 1st stage allows identifying scripts with high speed, yet less accuracy when dealing with noisy data. The advanced 2nd stage processes only those samples that yield low recognition confidence in the first stage. For both stages a rough character segmentation is performed and features are computed on segmented character components. Features used in the 1st stage are a 64-dimensional chain-code-histogram feature, while 400-dimensional gradient features are used in the 2nd stage. Final classification of a word to a particular script is done via majority voting of each recognized character component of the word. Extensive experiments with various confidence scores were conducted and reported here. The overall recognition accuracy and speed is remarkable. Correct classification of...

Sukalpa Chanda, Srikanta Pal, Katrin Franke, Umapa

Real-time Traffic

Character Component | Document Analysis | ICDAR 2009 | Low Recognition Confidence | Recognition Accuracy |

claim paper

Post Info
More Details (n/a)

Added	18 Feb 2011
Updated	18 Feb 2011
Type	Journal
Year	2009
Where	ICDAR
Authors	Sukalpa Chanda, Srikanta Pal, Katrin Franke, Umapada Pal

Comments (0)

Sciweavers

Two-stage Approach for Word-wise Script Identification

Character Component | Document Analysis | ICDAR 2009 | Low Recognition Confidence | Recognition Accuracy |

Explore & Download

Productivity Tools

Sciweavers