In many topic identification applications, supervised training labels are indirectly related to the semantic content of the documents being classified. For example, many topical...
In this paper, we propose a preference framework for information retrieval in which the user and the system administrator are enabled to express preference annotations on search ke...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly, and the need for flexible, sophisticated document manipulation tools is growi...
Floriana Esposito, Stefano Ferilli, Teresa Maria A...
Developing better systems for document image analysis requires understanding errors, their sources, and their effects. The interactions between various processing steps are comple...
Many approaches to unsupervised morphology acquisition incorporate the frequency of character sequences with respect to each other to identify word stems and affixes. This typical...