Sciweavers

JCDL
2003
ACM

Correcting Broken Characters in the Recognition of Historical Printed Documents

14 years 5 months ago
Correcting Broken Characters in the Recognition of Historical Printed Documents
This paper presents a new technique for dealing with broken characters, one of the major challenges in the optical character recognition (OCR) of degraded historical printed documents. A technique based on graph combinatorics is used to rejoin the appropriate connected components. It has been applied to real data with successful results.
Michael Droettboom
Added 05 Jul 2010
Updated 05 Jul 2010
Type Conference
Year 2003
Where JCDL
Authors Michael Droettboom
Comments (0)