We study dimensionality reduction or feature selection in text document categorization problem. We focus on the first step in building text categorization systems, that is the cho...
Abstract. In this paper, we propose a probabilistic approach to feature selection for multi-class text categorization. Specifically, we regard document class and occurrence of eac...
Ke Wu, Bao-Liang Lu, Masao Uchiyama, Hitoshi Isaha...
This paper presents a method for incorporating natural language processing into existing text categorization procedures. Three aspects are considered in the investigation: (i) a m...
In this paper we explore the potential of concept indexing with WordNet synsets for Text Categorization, in comparison with the traditional bag of words text representation model. ...
- Research work related to applying text categorization methods to a monolingual corpus such as English text collections has been well established by several research teams in rece...