This paper deals about text extraction from heterogeneous documents for categorizing documents and indexing tasks. The purpose of this work is to find similar text regions basing on their fonts. First text regions are extracted, and then font matching is performed using fractal descriptors. Experiments are done for both maps and ancient documents.
Badreddine Khelifi, Nizar Zaghden, Adel M. Alimi,