Towards Whole-Book Recognition

14 years 4 months ago

Download www.cse.lehigh.edu

We describe experimental results for unsupervised recognition of the textual contents of book-images using fully automatic mutual-entropy-based model adaptation. Each experiment starts with approximate iconic and linguistic models--derived from (generally errorful) OCR results and (generally incomplete) dictionaries--and then runs a fully automatic adaptation algorithm which, guided entirely by evidence internal to the test set, attempts to correct the models for improved accuracy. The iconic model describes image formation and determines the behavior of a character-image classifier. The linguistic model describes word-occurrence probabilities. Our adaptation algorithm detects disagreements between the models by analyzing mutual entropy between (1) the a posteriori probability distribution of character classes (the recognition results from image classification alone), and (2) the a posteriori probability distribution of word classes (the recognition results from image classification c...

Pingping Xiu, Henry S. Baird

Real-time Traffic

Adaptation Algorithm | DAS 2008 | Document Analysis | Passage Lengths | Posteriori Probability Distribution |

claim paper

Post Info
More Details (n/a)

Added	19 Oct 2010
Updated	19 Oct 2010
Type	Conference
Year	2008
Where	DAS
Authors	Pingping Xiu, Henry S. Baird

Comments (0)

Sciweavers

Towards Whole-Book Recognition

Adaptation Algorithm | DAS 2008 | Document Analysis | Passage Lengths | Posteriori Probability Distribution |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers