This paper examines several different approaches to exploiting structural information in semi-structured document categorization. The methods under consideration are designed for ...
Ordering principles of digital libraries expressed in ontologies may be highly heterogeneous even within a domain and especially over different cultures. Automatic methods for mapp...
To circumvent prevalent text-based anti-spam filters, spammers have begun embedding the advertisement text in images. Analogously, proprietary information (such as source code) ma...
Hrishikesh Aradhye, Gregory K. Myers, James A. Her...
This paper presents the results of a pilot study on using automatic text categorization techniques in identifying online sexual predators. We report on our SVM and k-NN models. Ou...
In this paper, we discuss the implementation and performance of our developed bibliographic navigator with the text mining. We categorize the different attributes and extend the m...