Abstract. This paper presents a system for retrieval of relevant documents from large document image collections. We achieve effective search and retrieval from a large collection ...
A. Balasubramanian, Million Meshesha, C. V. Jawaha...
There is a growing need to access historical Ottoman documents stored in large archives and therefore managing tools for automatic searching, indexing and transcription of these d...
Semantic analysis of a document collection can be viewed as an unsupervised clustering of the constituent words and documents around hidden or latent concepts. This has shown to i...
We developed a prototype for integrated retrieval and aggregation of diverse information contained in scanned paper documents. Such complex document information processing combine...
Shlomo Argamon, Gady Agam, Ophir Frieder, David A....
—A method for locating mathematical expressions in document images without the use of optical character recognition is presented. An index of document regions is produced from re...