Sciweavers

220 search results - page 8 / 44
» Language Independent Text Categorization
Sort
View
ECML
2006
Springer
13 years 9 months ago
Distributional Features for Text Categorization
Abstract-- Text categorization is the task of assigning predefined categories to natural language text. With the widely used `bag of words' representation, previous researches...
Xiao-Bing Xue, Zhi-Hua Zhou
NLDB
2005
Springer
14 years 1 months ago
The Role of Word Sense Disambiguation in Automated Text Categorization
Abstract. Automated Text Categorization has reached the levels of accuracy of human experts. Provided that enough training data is available, it is possible to learn accurate autom...
José María Gómez Hidalgo, Man...
NLDB
2004
Springer
14 years 1 months ago
Concept Indexing for Automated Text Categorization
In this paper we explore the potential of concept indexing with WordNet synsets for Text Categorization, in comparison with the traditional bag of words text representation model. ...
José María Gómez Hidalgo, Jos...
ICDAR
2007
IEEE
13 years 11 months ago
Identification of Latin-Based Languages through Character Stroke Categorization
This paper presents a language identification technique that detects Latin-based languages of imaged documents without OCR. The proposed technique detects languages through the wo...
S. J. Lu, L. Li, Chew Lim Tan
SIGIR
2010
ACM
13 years 11 months ago
Combining coregularization and consensus-based self-training for multilingual text categorization
We investigate the problem of learning document classifiers in a multilingual setting, from collections where labels are only partially available. We address this problem in the ...
Massih-Reza Amini, Cyril Goutte, Nicolas Usunier