Sciweavers

DAS
2008
Springer

Towards Whole-Book Recognition

14 years 2 months ago
Towards Whole-Book Recognition
We describe experimental results for unsupervised recognition of the textual contents of book-images using fully automatic mutual-entropy-based model adaptation. Each experiment starts with approximate iconic and linguistic models--derived from (generally errorful) OCR results and (generally incomplete) dictionaries--and then runs a fully automatic adaptation algorithm which, guided entirely by evidence internal to the test set, attempts to correct the models for improved accuracy. The iconic model describes image formation and determines the behavior of a character-image classifier. The linguistic model describes word-occurrence probabilities. Our adaptation algorithm detects disagreements between the models by analyzing mutual entropy between (1) the a posteriori probability distribution of character classes (the recognition results from image classification alone), and (2) the a posteriori probability distribution of word classes (the recognition results from image classification c...
Pingping Xiu, Henry S. Baird
Added 19 Oct 2010
Updated 19 Oct 2010
Type Conference
Year 2008
Where DAS
Authors Pingping Xiu, Henry S. Baird
Comments (0)