In this paper, we present a new approach to extracting the target text line from a document image captured by a pen scanner. Given the binary image, a set of possible text lines a...
In this paper, we address the problem of extracting data records and their attributes from unstructured biomedical full text. There has been little effort reported on this in the ...
This position paper argues for an interactive approach to text understanding. The proposed model extends an existing semantics-based text authoring system by using the input text ...
Scientific texts domain keyword is one of the basic elements of the text high-level semantics acquisition, domain ontology building and the knowledge representation in semantic gr...
Xiangfeng Luo, Ning Fang, Weimin Xu, Sheng Yu, Kai...
Machine-generated documents containing semi-structured text are rapidly forming the bulk of data being stored in an organisation. Given a feature-based representation of such data,...