—In this paper, we propose a novel method for extracting handwritten characters from multi-language document images, which may contain various types of characters, e.g. Chinese, ...
Yonghong Song, Guilin Xiao, Yuanlin Zhang, Lei Yan...
Over the past decade, multiple-instance learning (MIL)
has been successfully utilized to model the localized
content-based image retrieval (CBIR) problem, in which a
bag corresp...
Wu-Jun Li (Hong Kong University of Science and Tec...
The problem of writer identification in a multiscript environment is attempted using a twodimensional (2D) autoregressive (AR) modelling technique. Each writer is represented by a...
Content-only retrieval of XML documents deals with the problem of locating the smallest XML elements that satisfy the query. In this paper, we investigate the application of a spec...
Current approaches to script identification rely on hand-selected features and often require processing a significant part of the document to achieve reliable identification. We p...