Abstract. In this paper we propose a new approach to improve electronic editions of literary corpus, providing an efficient estimation of manuscripts pages structure. In any handwr...
PDF became a very common format for exchanging printable documents. Further, it can be easily generated from the major documents formats, which make a huge number of PDF documents...
In the last years the user information seeking process on the Web has shifted from document search to object search. Hence, the answers provided by Web search engines cannot consis...
In this paper we present a general framework for document production that covers generic document model needs and adaptation needs. We define a multimedia document model called Mad...
Structural information about a document is essential for structured query processing, indexing, and retrieval. A document page can be partitioned into a hierarchy of homogeneous r...