Logical entity recognition in heterogeneous collections of document page images remains a challenging problem since the performance of traditional supervised methods degrade drama...
In this paper we present an innovative two-stage adaptation approach for handwriting recognition that is based on clustering of similar pages in the training data. In our approach...
This paper presents preliminary results for document classification of ancient Hebrew manuscripts. The main goal is to discriminate between documents of different writing styles, ...
Itay Bar Yosef, Klara Kedem, Its'hak Dinstein, Mal...
Many text documents naturally have two kinds of labels. For example, we may label web pages from universities according to their categories, such as "student" or "fa...