Sciweavers

47 search results - page 3 / 10
» Text Degradations and OCR Training
Sort
View
ANLP
1994
134views more  ANLP 1994»
13 years 9 months ago
Degraded Text Recognition Using Word Collocation and Visual Inter-Word Constraints
Given a noisy text page, a word recognizer can generate a set of candidates for each word image. A relaxation algorithm was proposed previously by the authors that uses word collo...
Tao Hong, Jonathan J. Hull
EMNLP
2010
13 years 5 months ago
Evaluating Models of Latent Document Semantics in the Presence of OCR Errors
Models of latent document semantics such as the mixture of multinomials model and Latent Dirichlet Allocation have received substantial attention for their ability to discover top...
Daniel David Walker, William B. Lund, Eric K. Ring...
IJDAR
2000
60views more  IJDAR 2000»
13 years 7 months ago
Integrated text and line-art extraction from a topographic map
Our proposed approach to text and line-art extraction requires accurately locating a text-string box and identifying external line vectors incident on the box. The results of extra...
Luyang Li, George Nagy, Ashok Samal, Sharad C. Set...
ICDAR
2003
IEEE
14 years 1 months ago
Automatic Feature Selection with Applications to Script Identification of Degraded Documents
Current approaches to script identification rely on hand-selected features and often require processing a significant part of the document to achieve reliable identification. We p...
Vitaly Ablavsky, Mark R. Stevens
CVPR
2004
IEEE
14 years 9 months ago
Detecting and Reading Text in Natural Scenes
This paper gives an algorithm for detecting and reading text in natural images. The algorithm is intended for use by blind and visually impaired subjects walking through city scen...
Xiangrong Chen, Alan L. Yuille