Language Model Integration for the Recognition of Handwritten Medieval Documents

15 years 4 months ago

Download www.cvc.uab.es

Building recognition systems for historical documents is a difficult task. Especially, when it comes to medieval scripts. The complexity is mainly affected by the poor quality and the small quantity of the data available. In this paper we apply an HMM based recognition system to medieval manuscripts from the 13th century written in Middle High German. The recognition system, which was originally developed for modern scripts, has been adapted to medieval scripts. Beside the data processing, one of the major challenges is to create a suitable language model. Because of the lack of appropriate independent text corpora for medieval languages, the language model has to be created on the base of a rather small number of manuscripts only. Due to the small size of the corpus, optimizing the language model parameters can quickly lead to the problem of overfitting. In this paper we describe a strategy to integrate all available information into the language model and to optimize the language mo...

Markus Wüthrich, Marcus Liwicki, Andreas Fisc

Real-time Traffic

Document Analysis | ICDAR 2009 | Language Model | Language Model Parameters | Medieval |

claim paper

» Generic scalespace process for handwriting documents analysis

» Handwritten Mail Classification Experiments with the Rimes Database

» Language Models for Handwritten Short Message Services

» Handling OutofVocabulary Words and Recognition Errors Based on Word Linguistic Context for...

» Toward affine recognition of handwritten mathematical characters

» Boosted decision trees for word recognition in handwritten document retrieval

» Integration of Statistical Models for Dictation of Document Translations in a MachineAided...

» A Hybrid Model for Recognition of Online Handwriting in Indian Scripts

Post Info
More Details (n/a)

Added	18 Feb 2011
Updated	18 Feb 2011
Type	Journal
Year	2009
Where	ICDAR
Authors	Markus Wüthrich, Marcus Liwicki, Andreas Fischer, Emanuel Indermühle, Horst Bunke, Gabriel Viehhauser, Michael Stolz

Comments (0)

Sciweavers

Language Model Integration for the Recognition of Handwritten Medieval Documents

Document Analysis | ICDAR 2009 | Language Model | Language Model Parameters | Medieval |

Explore & Download

Productivity Tools

Sciweavers