Sciweavers

ICPR
2002
IEEE

Word Segmentation of Printed Text Lines Based on Gap Clustering and Special Symbol Detection

15 years 1 months ago
Word Segmentation of Printed Text Lines Based on Gap Clustering and Special Symbol Detection
This paper proposes a word segmentation method for machine-printed text lines. It utilizes gaps and special symbols as delimiters between words. A gap clustering technique is used to identify the gaps between words regardless of the gap-size variations among different document images. Next a special symbol detection technique is applied to find two types of special symbols lying between words. An experiment with 1,675 text lines in 100 different English and Korean documents shows that the proposed method achieves a high accuracy of word segmentation.
Soo-Hyung Kim, Chang Bu Jeong, Hee K. Kwag, Ching
Added 09 Nov 2009
Updated 09 Nov 2009
Type Conference
Year 2002
Where ICPR
Authors Soo-Hyung Kim, Chang Bu Jeong, Hee K. Kwag, Ching Y. Suen
Comments (0)