Sciweavers

CVPR
2012
IEEE
12 years 2 months ago
Enhanced continuous sign language recognition using PCA and neural network features
In this work a Gaussian Hidden Markov Model (GHMM) based automatic sign language recognition system is built on the SIGNUM database. The system is trained on appearance-based feat...
Yannick L. Gweth, Christian Plahl, Hermann Ney
JMLR
2012
12 years 2 months ago
Bounding the Probability of Error for High Precision Optical Character Recognition
We consider a model for which it is important, early in processing, to estimate some variables with high precision, but perhaps at relatively low recall. If some variables can be ...
Gary B. Huang, Andrew Kae, Carl Doersch, Erik G. L...
ICDAR
2011
IEEE
12 years 11 months ago
Math Spotting: Retrieving Math in Technical Documents Using Handwritten Query Images
—A method for locating mathematical expressions in document images without the use of optical character recognition is presented. An index of document regions is produced from re...
Richard Zanibbi, Li Yu
ICDAR
2011
IEEE
12 years 11 months ago
Character Enhancement for Historical Newspapers Printed Using Hot Metal Typesetting
—We propose a new method for an effective removal of the printing artifacts occurring in historical newspapers which are caused by problems in the hot metal typesetting, a widely...
Iuliu Vasile Konya, Stefan Eickeler, Christoph Sei...
ICDAR
2011
IEEE
12 years 11 months ago
A Novel Italic Detection and Rectification Method for Chinese Advertising Images
—The italic detection and slant rectification is a key step of optical character recognition (OCR). In this paper, a novel method is proposed to detect and rectify italic charact...
Jie Liu, Heping Li, Shuwu Zhang, Wei Liang
CIKM
2011
Springer
12 years 11 months ago
Partial duplicate detection for large book collections
A framework is presented for discovering partial duplicates in large collections of scanned books with optical character recognition (OCR) errors. Each book in the collection is r...
Ismet Zeki Yalniz, Ethem F. Can, R. Manmatha
JMM2
2007
100views more  JMM2 2007»
13 years 11 months ago
On Separation of English Numerals from Multilingual Document Images
— For Optical Character Recognition (OCR) of bilingual or multilingual document containing text words in regional language and numerals in English, it is necessary to identify di...
Basanna V. Dhandra, Mallikarjun Hangarge
IVC
2007
104views more  IVC 2007»
13 years 11 months ago
Text segmentation in color images using tensor voting
In natural scene, text elements are corrupted by many types of noise, such as streaks, highlights, or cracks. These effects make the clean and automatic segmentation very difficu...
Jaeguyn Lim, Jonghyun Park, Gérard G. Medio...
SIGIR
2008
ACM
13 years 11 months ago
Optical character recognition errors and their effects on natural language processing
Errors are unavoidable in advanced computer vision applications such as optical character recognition, and the noise induced by these errors presents a serious challenge to downstr...
Daniel P. Lopresti
ANLP
2000
169views more  ANLP 2000»
14 years 1 months ago
Named Entity Extraction from Noisy Input: Speech and OCR
In this paper, we analyze the performance of name finding in the context of a variety of automatic speech recognition (ASR) systems and in the context of one optical character rec...
David R. H. Miller, Sean Boisen, Richard M. Schwar...