Sciweavers

ICDAR
2011
IEEE
12 years 11 months ago
Character n-Gram Spotting in Document Images
—In this paper, we present a novel approach to search and retrieve from document image collections, without explicit recognition. Existing recognition-free approaches such as wor...
M. Sudha Praveen, K. Pramod Sankar, C. V. Jawahar
ICDAR
2011
IEEE
12 years 11 months ago
BLSTM Neural Network Based Word Retrieval for Hindi Documents
—Retrieval from Hindi document image collections is a challenging task. This is partly due to the complexity of the script, which has more than 800 unique ligatures. In addition,...
Raman Jain, Volkmar Frinken, C. V. Jawahar, Raghav...
IJDAR
2008
136views more  IJDAR 2008»
13 years 11 months ago
Matching word images for content-based retrieval from printed document images
As large quantity of document images is getting archived by the digital libraries, there is a need for an efficient search strategies to make them available as per users informatio...
Million Meshesha, C. V. Jawahar
ANLP
1994
134views more  ANLP 1994»
14 years 23 days ago
Degraded Text Recognition Using Word Collocation and Visual Inter-Word Constraints
Given a noisy text page, a word recognizer can generate a set of candidates for each word image. A relaxation algorithm was proposed previously by the authors that uses word collo...
Tao Hong, Jonathan J. Hull
DAS
2008
Springer
14 years 1 months ago
A Comparison of Clustering Methods for Word Image Indexing
In this paper we explore the effectiveness of three clustering methods used to perform word image indexing. The three methods are: the Self-Organazing Map (SOM), the Growing Hiera...
Simone Marinai, Emanuele Marino, Giovanni Soda
ICDAR
1995
IEEE
14 years 3 months ago
Visual inter-word relations and their use in OCR postprocessing
A technique is presented that uses visual relationships between word images in a document to improve the recognition of the text it contains. This technique takes advantage of the...
Tao Hong, Jonathan J. Hull
ACCV
2007
Springer
14 years 3 months ago
Efficient Search in Document Image Collections
This paper presents an efficient indexing and retrieval scheme for searching in document image databases. In many non-European languages, optical character recognizers are not very...
Anand Kumar 0002, C. V. Jawahar, R. Manmatha
DAS
2010
Springer
14 years 3 months ago
Towards more effective distance functions for word image matching
Matching word images has many applications in document recognition and retrieval systems. Dynamic Time Warping (DTW) is popularly used to estimate the similarity between word imag...
Raman Jain, C. V. Jawahar
ICPR
2004
IEEE
15 years 16 days ago
Italic Font Recognition Using Stroke Pattern Analysis on Wavelet Decomposed Word Images
This paper describes an italic font recognition method using stroke pattern analysis on wavelet decomposed word images. The word images are extracted from scanned text documents c...
Chew Lim Tan, Li Zhang, Yue Lu