Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

169

SAC
2005
ACM

130views Applied Computing» more SAC 2005»

Performance evaluation for text processing of noisy inputs

16 years 5 days ago

Performance evaluation for text processing of noisy inputs

Download www.cse.lehigh.edu

We investigate the problem of evaluating the performance of text processing algorithms on inputs that contain errors as a result of optical character recognition. A new hierarchical paradigm is proposed based on approximate string matching, allowing each stage in the processing pipeline to be tested, the error eﬀects analyzed, and possible solutions suggested. Categories and Subject Descriptors I.7.5 [Document and Text Processing]: Document Capture—document analysis General Terms algorithms, measurement, performance Keywords performance evaluation, optical character recognition, sentence boundary detection, tokenization, part-of-speech tagging

Daniel P. Lopresti

Real-time Traffic

Applied Computing | Optical Character Recognition | SAC 2005 | Text Processing | Text Processing Algorithms |

claim paper

Related Content

» A comprehensive evaluation methodology for noisy historical document recognition technique...

» Improving Mention Detection Robustness to Noisy Input

» Unsupervised Evaluation of Parser Robustness

» Inputbased Language Modelling in the Design of High Performance Text Input Techniques

» Fast LexiconBased Word Recognition in Noisy Index Card Images

» A WaveletBased NoiseAware Method for Fusing Noisy Imagery

» Tools for monitoring visualizing and refining collections of noisy documents

» Subspace Mapping of Noisy Text Documents

» Adapting a WSJTrained Parser to Grammatically Noisy Text

Post Info
More Details (n/a)

Added	26 Jun 2010
Updated	26 Jun 2010
Type	Conference
Year	2005
Where	SAC
Authors	Daniel P. Lopresti

Comments (0)