– This paper describes a new approach to recognize touching numeral strings. Currently most methods for numeral string recognition require segmenting the string image into separate numerals. As a result, the recognition system heavily depends on the reliability of the segmentation module. This study explores the holistic strategy directly on the string images without segmentation. It builds the novel classifier by combining binary classifiers based on Data-driven Error Correcting Output Coding (DECOC). The dimensions of input images are reduced using principal components analysis. Support vector machines are used as base learners. Experiments on NIST SD19 touching numeral pairs confirm that DECOC can achieve favorable performance compared with other multi-class holistic classifiers. The method provides the flexibility of controlling the computational complexity versus accuracy. We also discuss an implementation suitable for distributing computing by decomposing the ensemble into...
Jie Zhou, Ching Y. Suen