Sciweavers

47 search results - page 2 / 10
» Text Degradations and OCR Training
Sort
View
ECIR
2009
Springer
14 years 4 months ago
Revisiting N-Gram Based Models for Retrieval in Degraded Large Collections
The traditional retrieval models based on term matching are not effective in collections of degraded documents (output of OCR or ASR systems for instance). This paper presents a n...
Javier Parapar, Ana Freire, Alvaro Barreiro
ICDAR
2003
IEEE
14 years 1 months ago
A Case Restoration Approach to Named Entity Tagging in Degraded Documents
This paper describes a novel approach to named entity (NE) tagging on degraded documents. NE tagging is the process of identifying salient text strings in unstructured text, corre...
Rohini K. Srihari, Cheng Niu, Wei Li, Jihong Ding
ERCIMDL
2009
Springer
117views Education» more  ERCIMDL 2009»
14 years 2 months ago
Improving OCR Accuracy for Classical Critical Editions
This paper describes a work-flow designed to populate a digital library of ancient Greek critical editions with highly accurate OCR scanned text. While the most recently available...
Federico Boschetti, Matteo Romanello, Alison Babeu...
ICDAR
2009
IEEE
13 years 5 months ago
Learning on the Fly: Font-Free Approaches to Difficult OCR Problems
Despite ubiquitous claims that optical character recognition (OCR) is a "solved problem," many categories of documents continue to break modern OCR software such as docu...
Andrew Kae, Erik G. Learned-Miller
ICPR
2006
IEEE
14 years 8 months ago
CAPTCHA Challenge Tradeoffs: Familiarity of Strings versus Degradation of Images
It is a well documented fact that, for human readers, familiar text is more legible than unfamiliar text. Current-generation computer vision systems also are able to exploit some ...
Jon Louis Bentley, Sui-Yu Wang