Sciweavers

13 search results - page 2 / 3
» Retrieving poorly degraded OCR documents
Sort
View
ICDAR
2003
IEEE
14 years 22 days ago
A Case Restoration Approach to Named Entity Tagging in Degraded Documents
This paper describes a novel approach to named entity (NE) tagging on degraded documents. NE tagging is the process of identifying salient text strings in unstructured text, corre...
Rohini K. Srihari, Cheng Niu, Wei Li, Jihong Ding
ICDAR
2009
IEEE
14 years 2 months ago
Keyword Spotting in Document Images through Word Shape Coding
With large databases of document images available, a method for users to find keywords in documents will be useful. One approach is to perform Optical Character Recognition (OCR) ...
Shuyong Bai, Linlin Li, Chew Lim Tan
ICDAR
2011
IEEE
12 years 7 months ago
BLSTM Neural Network Based Word Retrieval for Hindi Documents
—Retrieval from Hindi document image collections is a challenging task. This is partly due to the complexity of the script, which has more than 800 unique ligatures. In addition,...
Raman Jain, Volkmar Frinken, C. V. Jawahar, Raghav...
ICPR
2006
IEEE
14 years 8 months ago
CAPTCHA Challenge Tradeoffs: Familiarity of Strings versus Degradation of Images
It is a well documented fact that, for human readers, familiar text is more legible than unfamiliar text. Current-generation computer vision systems also are able to exploit some ...
Jon Louis Bentley, Sui-Yu Wang
CIKM
2001
Springer
13 years 12 months ago
Improved String Matching Under Noisy Channel Conditions
Many document-based applications, including popular Web browsers, email viewers, and word processors, have a ‘Find on this Page’ feature that allows a user to find every occur...
Kevyn Collins-Thompson, Charles Schweizer, Susan T...