Text Categorization for Multi-label Documents and Many Categories

16 years 1 months ago

Download www.prism.uvsq.fr

In this paper, we propose a new classification method that addresses classification in multiple categories of textual documents. We call it Matrix Regression (MR) due to its resemblance to regression in a high dimensional space. Experiences on a medical corpus of hospital records to be classified by ICD (International Classification of Diseases) code demonstrate the validity of the MR approach. We compared MR with three frequently used algorithms in text categorization that are k-Nearest Neighbors, Centroide and Support Vector Machine. The experimental results show that our method outperforms them in both precision and time of classification.

Iulian Sandu Popa, Karine Zeitouni, Georges Gardar

Real-time Traffic

CBMS 2007 | International Classification | Medical Imaging | MR Approach | Support Vector Machine |

claim paper

» Exploiting Category Information and Document Information to Improve Term Weighting for Tex...

» Text categorization with many redundant features using aggressive feature selection to mak...

» Largescale text categorization by batch mode active learning

» Multilabel Text Categorization Using VGRAM Weightless Neural Networks

» An Empirical Study of Category Skew on Feature Selection for Text Categorization

» An EM Based Training Algorithm for CrossLanguage Text Categorization

» Summarization as Feature Selection for Document Categorization on Small Datasets

» Improving VGRAM WNN Multilabel Text Categorization via Label Correlation

Post Info
More Details (n/a)

Added	02 Jun 2010
Updated	02 Jun 2010
Type	Conference
Year	2007
Where	CBMS
Authors	Iulian Sandu Popa, Karine Zeitouni, Georges Gardarin, Didier Nakache, Elisabeth Métais

Comments (0)

Sciweavers

Text Categorization for Multi-label Documents and Many Categories

CBMS 2007 | International Classification | Medical Imaging | MR Approach | Support Vector Machine |

Explore & Download

Productivity Tools

Sciweavers