Document registration is a problem where the image of a template document whose layout is known is registered with a test document image. Given the registration parameters, layout...
A major obstacle that decreases the performance of text classifiers is the extremely high dimensionality of text data. To reduce the dimension, a number of approaches based on rou...
In some domains, Information Extraction (IE) from texts requires syntactic and semantic parsing. This analysis is computationally expensive and IE is potentially noisy if it applie...
: Problem statement: Term extraction is one of the layers in the ontology development process which has the task to extract all the terms contained in the input document automatica...
The structure of the web is increasingly being used to improve organization, search, and analysis of information on the web. For example, Google uses the text in citing documents ...
Eric J. Glover, Kostas Tsioutsiouliklis, Steve Law...