Sciweavers

ICDAR
2009
IEEE

Learning Rich Hidden Markov Models in Document Analysis: Table Location

14 years 7 months ago
Learning Rich Hidden Markov Models in Document Analysis: Table Location
Hidden Markov Models (HMM) are probabilistic graphical models for interdependent classification. In this paper we experiment with different ways of combining the components of an HMM for document analysis applications, in particular for finding tables in text. We show: a) how to integrate different document structure finders into the HMM; b) that transition probabilities should vary along the chain to embed general knowledge axioms of our field, c) some emission energies can be selectively ignored, and d) emission and transition probabilities can be weighed differently. We conclude these changes increase the expressiveness and usability of HMMs in our field.
Ana Costa e Silva
Added 21 May 2010
Updated 21 May 2010
Type Conference
Year 2009
Where ICDAR
Authors Ana Costa e Silva
Comments (0)