Sciweavers

SAC
2005
ACM

Performance evaluation for text processing of noisy inputs

14 years 6 months ago
Performance evaluation for text processing of noisy inputs
We investigate the problem of evaluating the performance of text processing algorithms on inputs that contain errors as a result of optical character recognition. A new hierarchical paradigm is proposed based on approximate string matching, allowing each stage in the processing pipeline to be tested, the error effects analyzed, and possible solutions suggested. Categories and Subject Descriptors I.7.5 [Document and Text Processing]: Document Capture—document analysis General Terms algorithms, measurement, performance Keywords performance evaluation, optical character recognition, sentence boundary detection, tokenization, part-of-speech tagging
Daniel P. Lopresti
Added 26 Jun 2010
Updated 26 Jun 2010
Type Conference
Year 2005
Where SAC
Authors Daniel P. Lopresti
Comments (0)