Sciweavers

196 search results - page 5 / 40
» An OCR System for Printed Documents
Sort
View
ICDAR
1997
IEEE
13 years 11 months ago
Representing OCRed documents in HTML
ABSTRACT: OCR is an error-prone process. It is time-consuming and expensive to manually proofread OCR results. The errors remaining in OCRed texts can cause serious problems in rea...
Tao Hong, Sargur N. Srihari
ICDAR
2011
IEEE
12 years 7 months ago
Towards Searchable Digital Urdu Libraries - A Word Spotting Based Retrieval Approach
—Libraries in South Asia hold huge collections of valuable printed documents in Urdu and it is of interest to digitize these collections to make them more accessible. The unavail...
Ali Abidi, Imran Siddiqi, Khurram Khurshid
ICDAR
2005
IEEE
14 years 1 months ago
Text Degradations and OCR Training
Printing and scanning of text documents introduces degradations to the characters which can be modeled. Interestingly, certain combinations of the parameters that govern the degra...
Elisa H. Barney Smith, Tim L. Andersen
ICDAR
2009
IEEE
14 years 2 months ago
Robust Recognition of Documents by Fusing Results of Word Clusters
The word error rate of any optical character recognition system (OCR) is usually substantially below its component or character error rate. This is especially true of Indic langua...
Venkat Rasagna, Anand Kumar 0002, C. V. Jawahar, R...
DIAL
2004
IEEE
170views Image Analysis» more  DIAL 2004»
13 years 11 months ago
A General System for the Retrieval of Document Images from Digital Libraries
Large collections of scanned documents (books and journals) are now available in Digital Libraries. The most common method for retrieving relevant information from these collectio...
Simone Marinai, Emanuele Marino, Francesca Cesarin...