Sciweavers

ECAI
2006
Springer
14 years 3 months ago
Text Sampling and Re-Sampling for Imbalanced Authorship Identification Cases
Authorship identification can be seen as a single-label multi-class text categorization problem. Very often, there are extremely few training texts at least for some of the candida...
Efstathios Stamatatos
ISNN
2007
Springer
14 years 5 months ago
A Probabilistic Approach to Feature Selection for Multi-class Text Categorization
Abstract. In this paper, we propose a probabilistic approach to feature selection for multi-class text categorization. Specifically, we regard document class and occurrence of eac...
Ke Wu, Bao-Liang Lu, Masao Uchiyama, Hitoshi Isaha...