This paper shows an approach for converting bitmap images of text glyphs into a vector format which is suitable for being embedded in XML representations of digitized documents. T...
Stefan Pletschacher, Marcel Eckert, Arved C. H&uum...
Electronic publishing of material digitized using imaging and OCR calls for a special delivery format capable of reconstructing original documents in a well-usable electronic form...
In order to reduce the rejection rate of our automatic reading system, we propose to pre-classify the business documents by introducing an Automatic Recognition of Documents stage...
Optical Characters Recognition (OCR) has been an active subject of research since the early days of computers. Despite the age of the subject, it remains one of the most challengin...
We report on the creation of a database composed of images of Arabic Printed words. The purpose of this database is the large-scale benchmarking of openvocabulary, multi-font, mul...
Fouad Slimane, Rolf Ingold, Slim Kanoun, Adel M. A...