In the traditional setting, text categorization is formulated as a concept learning problem where each instance is a single isolated document. However, this perspective is not appr...
Today's digital libraries increasingly include not only printed text but also scanned handwritten pages and other multimedia material. There are, however, few tools available...
Documents in many corpora, such as digital libraries and webpages, contain both content and link information. In a traditional topic model which plays an important role in the uns...
The original SenseMaker interface for information exploration [2] used tables to present heterogeneous document descriptions. In contrast, printed bibliographies and World Wide We...
As ancient documents are being digitized, systems for retrieving documents or images can now be found in Digital Libraries. With regard to illustrations, the content-based image r...