Sciweavers

PR
2016

Open-vocabulary recognition of machine-printed Arabic text using hidden Markov models

8 years 7 months ago
Open-vocabulary recognition of machine-printed Arabic text using hidden Markov models
—In this paper, we present multi-font printed Arabic text recognition using hidden Markov models (HMMs). We propose a novel approach to the sliding window technique for feature extraction. The size and position of the cells of the sliding window adapt to the writing line of Arabic text and ink-pixel distributions. We employ a two-step approach for mixed-font text recognition, in which the input text line image is associated with the closest known font in the first step, using simple and effective features for font identification. The text line is subsequently recognized by the recognizer that was trained for the particular font in the next step. This approach proves to be more effective than text recognition, which employs a recognizer trained on samples from multiple fonts. We also present a framework for the recognition of unseen fonts, which employs font association and HMM adaptation techniques. Experiments were conducted using two separate databases of printed Arabic text to dem...
Irfan Ahmad, Sabri A. Mahmoud, Gernot A. Fink
Added 09 Apr 2016
Updated 09 Apr 2016
Type Journal
Year 2016
Where PR
Authors Irfan Ahmad, Sabri A. Mahmoud, Gernot A. Fink
Comments (0)