XML databases often contain documents comprising structured text. Therefore, it is important to integrate "information retrieval style" query evaluation, which is well-s...
Although detecting text lines in machine printed documents is typically considered a solved problem, it is still a challenge to segment handwritten text lines in the general sense...
In this article, we propose a method of characterization of pictures of old documents based on a texture approach. This characterization is carried out with the help of a multires...
We present a static index pruning method, to be used in ad-hoc document retrieval tasks, that follows a documentcentric approach to decide whether a posting for a given term shoul...
This paper presents a novel block-based segmentation and adaptive coding(BSAC) algorithm for visually lossless compression of scanned documents that contain not only photographic ...