Sciweavers

PAMI
2002

Lexicon-Driven Segmentation and Recognition of Handwritten Character Strings for Japanese Address Reading

13 years 11 months ago
Lexicon-Driven Segmentation and Recognition of Handwritten Character Strings for Japanese Address Reading
This paper describes a handwritten character string recognition system for Japanese mail address reading on very large vocabulary. The address phrases are recognized as a whole because there is no extra space between words. The lexicon contains 111,349 address phrases, which are stored in a trie structure. In recognition, the text line image is matched with the lexicon entries (phrases) to obtain reliable segmentation and retrieve valid address phrases. In this paper, we first introduce some effective techniques for text line image preprocessing and presegmentation. In presegmentation, the text line image is separated into primitive segments by connected component analysis and touching pattern splitting based on contour shape analysis. In lexicon matching, consecutive segments are dynamically combined into candidate character patterns. An accurate character classifier is embedded in lexicon matching to select characters matched with a candidate pattern from a dynamic category set. A be...
Cheng-Lin Liu, Masashi Koga, Hiromichi Fujisawa
Added 23 Dec 2010
Updated 23 Dec 2010
Type Journal
Year 2002
Where PAMI
Authors Cheng-Lin Liu, Masashi Koga, Hiromichi Fujisawa
Comments (0)