We combine information from a language model and character image pattern matching to iteratively reduce ambiguity in document images. Combining word shape information and lists of similar bitmap patterns in a document at least partially resolves the character content without optical character recognition. We present the output in various ways. suitable for human readers or for differing downstream processes.
A. Lawrence Spitz