With the rapid emergence and proliferation of Internet and the trend of globalization, a tremendous amount of textual documents written in different languages are electronically ac...
Abstract. In this paper, we propose a probabilistic approach to feature selection for multi-class text categorization. Specifically, we regard document class and occurrence of eac...
Ke Wu, Bao-Liang Lu, Masao Uchiyama, Hitoshi Isaha...
This paper proposes a new approach for text categorization, based on a feature projection technique. In our approach, training data are represented as the projections of training ...
This paper presents an approach to text categorization that i) uses no machine learning and ii) reacts on-the-fly to unknown words. These features are important for categorizing B...
When text categorization is applied to complex tasks, it is tedious and expensive to hand-label the large amounts of training data necessary for good performance. In this paper, we...