Many documents are available to a computer only as images from paper. However, most natural language processing systems expect their input as character-coded text, which may be di...
— For Optical Character Recognition (OCR) of bilingual or multilingual document containing text words in regional language and numerals in English, it is necessary to identify di...
This paper describes a new versatile algorithm for correcting nonlinear distortions, such as curvature of book pages, in camera based document processing. We introduce the idea of...
Despite advances in the archiving of digital video, we are still unable to efficiently search and retrieve the portions that interest us. Video indexing by shot segmentation has b...
Sameer Antani, David J. Crandall, Rangachar Kastur...
Skew detection via principal components is proposed as an e ective methodforimageswhich contain other parts than text. It is shown that the negative of the image leads to much mor...