Sciweavers

ICDAR
2009
IEEE

Using Kernel Density Classifier with Topic Model and Cost Sensitive Learning for Automatic Text Categorization

13 years 9 months ago
Using Kernel Density Classifier with Topic Model and Cost Sensitive Learning for Automatic Text Categorization
This paper proposes a novel framework for automatic text categorization problem based on the kernel density classifier. The overall goal is to tackle two main issues in automatic text categorization problems: the interpretability and the performance. Specifically, to solve the interpretability issue, the Latent Semantic Analysis technique is used to construct a topic space, in which each dimension represents a single topic. The text features are extracted directly from this topic space. To solve the performance issue, classifiers' parameters are optimized for either costsensitive or non-cost-sensitive categorization. We have experimentally evaluated the proposed framework by using a corpus of twenty newsgroups. The experimental results confirm the effectiveness of the framework to utilize the features from the topic model for cost-sensitive categorization.
Dwi Sianto Mansjur, Ted S. Wada, Biing-Hwang Juang
Added 18 Feb 2011
Updated 18 Feb 2011
Type Journal
Year 2009
Where ICDAR
Authors Dwi Sianto Mansjur, Ted S. Wada, Biing-Hwang Juang
Comments (0)