Sciweavers

185 search results - page 23 / 37
» A Re-Examination of Text Categorization Methods
Sort
View
AAAI
1998
13 years 8 months ago
Feature Generation for Sequence Categorization
The problem of sequence categorization is to generalize from a corpus of labeled sequences procedures for accurately labeling future unlabeled sequences. The choice of representat...
Daniel Kudenko, Haym Hirsh
ICDAR
2007
IEEE
13 years 11 months ago
Identification of Latin-Based Languages through Character Stroke Categorization
This paper presents a language identification technique that detects Latin-based languages of imaged documents without OCR. The proposed technique detects languages through the wo...
S. J. Lu, L. Li, Chew Lim Tan
DMIN
2006
114views Data Mining» more  DMIN 2006»
13 years 8 months ago
Towards Using Fewer Features for Text Classification
Abstract-- Text classification or categorization is a conventional classification problem applied to the text domain. In the cases when statistical classification methods are used,...
Yuan Yuan, Tianyang Gu
COMSIS
2011
12 years 11 months ago
Ontology-based multi-label classification of economic articles
The paper presents an approach to the task of automatic document categorization in the field of economics. Since the documents can be annotated with multiple keywords (labels), we ...
Sergeja Vogrincic, Zoran Bosnic
SIGIR
2008
ACM
13 years 7 months ago
Exploiting subjectivity analysis in blogs to improve political leaning categorization
In this paper, we address a relatively new and interesting text categorization problem: classify a political blog as either liberal or conservative, based on its political leaning...
Maojin Jiang, Shlomo Argamon