Term-based representations of documents have found widespread use in information retrieval. However, one of the main shortcomings of such methods is that they largely disregard le...
Text summarization solves the problem of extracting important information from huge amount of text data. There are various methods in the literature that aim to find out well-form...
Repetition of layout structure is prevalent in document images. In document design, such repetition conveys the underlying logical and functional structure of the data. For exampl...
In this paper we propose to define a measure of visual similarity to compare different pages in a corpus. This measure is based on the analysis of the visual layout saliency of th...
Text detection in video images has received increasing attention, particularly in scene text detection in video images, as it plays a vital role in video indexing and information ...
Palaiahnakote Shivakumara, Trung Quy Phan, Chew Li...