Sciweavers

COMSIS
2011
13 years 3 months ago
Ontology-based multi-label classification of economic articles
The paper presents an approach to the task of automatic document categorization in the field of economics. Since the documents can be annotated with multiple keywords (labels), we ...
Sergeja Vogrincic, Zoran Bosnic
TAL
2010
Springer
13 years 9 months ago
Summarization as Feature Selection for Document Categorization on Small Datasets
Abstract. Most common feature selection techniques for document categorization are supervised and require lots of training data in order to accurately capture the descriptive and d...
Emmanuel Anguiano-Hernández, Luis Villase&n...
IPM
2006
104views more  IPM 2006»
13 years 11 months ago
Hierarchical document categorization with k-NN and concept-based thesauri
In this paper, we propose a new algorithm, which incorporates the relationships of concept-based thesauri into the document categorization using the k-NN classifier (k-NN). k-NN i...
Sun Lee Bang, Jae Dong Yang, Hyung Jeong Yang
IJIT
2004
14 years 25 days ago
The Usefulness of Logical Structure in Flexible Document Categorization
This paper presents a new approach for automatic document categorization. Exploiting the logical structure of the document, our approach assigns a HTML document to one or more cate...
Jebari Chaker, Habib Ounelli
ESANN
2007
14 years 26 days ago
Kernel PCA based clustering for inducing features in text categorization
We study dimensionality reduction or feature selection in text document categorization problem. We focus on the first step in building text categorization systems, that is the cho...
Zsolt Minier, Lehel Csató
TAL
2004
Springer
14 years 4 months ago
One Size Fits All? A Simple Technique to Perform Several NLP Tasks
Word fragments or n-grams have been widely used to perform different Natural Language Processing tasks such as information retrieval [1] [2], document categorization [3], automatic...
Daniel Gayo-Avello, Darío Álvarez Gu...
KDD
2008
ACM
183views Data Mining» more  KDD 2008»
14 years 12 months ago
Structured entity identification and document categorization: two tasks with one joint model
Traditionally, research in identifying structured entities in documents has proceeded independently of document categorization research. In this paper, we observe that these two t...
Indrajit Bhattacharya, Shantanu Godbole, Sachindra...