Sciweavers

196 search results - page 6 / 40
» An OCR System for Printed Documents
Sort
View
NAACL
2003
13 years 8 months ago
A Generative Probabilistic OCR Model for NLP Applications
In this paper, we introduce a generative probabilistic optical character recognition (OCR) model that describes an end-to-end process in the noisy channel framework, progressing f...
Okan Kolak, William J. Byrne, Philip Resnik
ICDAR
2009
IEEE
13 years 5 months ago
Pre-Processing of Degraded Printed Documents by Non-local Means and Total Variation
We compare in this study two image restoration approaches for the pre-processing of printed documents: namely the Non-local Means filter and a total variation minimization approac...
Laurence Likforman-Sulem, Jérôme Darb...
ICDAR
2009
IEEE
14 years 2 months ago
Recognition of Degraded Handwritten Characters Using Local Features
The main problems of Optical Character Recognition (OCR) systems are solved if printed latin text is considered. Since OCR systems are based upon binary images, their results are ...
Markus Diem, Robert Sablatnig
JCDL
2006
ACM
176views Education» more  JCDL 2006»
14 years 1 months ago
A hierarchical, HMM-based automatic evaluation of OCR accuracy for a digital library of books
A number of projects are creating searchable digital libraries of printed books. These include the Million Book Project, the Google Book project and similar efforts from Yahoo an...
Shaolei Feng, R. Manmatha
ICDAR
1999
IEEE
13 years 11 months ago
Multifont Classification using Typographical Attributes
This paper introduces a multifont classification scheme to help recognition of multifont and multisize characters. It uses typographical attributes such as ascenders, descenders a...
Min-Chul Jung, Yong-Chul Shin, Sargur N. Srihari