There is an approach of annotation extraction from printed documents in which annotations are extracted by comparing the image of an annotated document and its original document i...
This paper introduces a framework for clarifying and formalizing the duplicate document detection problem. Four distinct models are presented, each with a corresponding algorithm ...
To distinguish image features from high level noises, we propose to exploit the spatial correlations between wavelet coefficients by replacing the thresholding process with a dif...
Degraded documents are frequently obtained in various situations. Examples of degraded document collections include historical document depositories, document obtained in legal an...
Layout analysis is a fundamental step in automatic document processing. Many different techniques have been proposed in literature to perform this task. These are broadly divided ...