Sciweavers

ACL
2006
14 years 29 days ago
Exploiting Comparable Corpora and Bilingual Dictionaries for Cross-Language Text Categorization
Cross-language Text Categorization is the task of assigning semantic classes to documents written in a target language (e.g. English) while the system is trained using labeled doc...
Alfio Massimiliano Gliozzo, Carlo Strapparava
LREC
2008
82views Education» more  LREC 2008»
14 years 1 months ago
An eRulemaking Corpus: Identifying Substantive Issues in Public Comments
We describe the creation of a corpus that supports a real-world hierarchical text categorization task in the domain of electronic rulemaking (eRulemaking). Features of the task an...
Claire Cardie, Cynthia Farina, Matt Rawding, Adil ...
DGO
2008
113views Education» more  DGO 2008»
14 years 1 months ago
A study in rule-specific issue categorization for e-rulemaking
We address the e-rulemaking problem of categorizing public comments according to the issues that they address. In contrast to previous text categorization research in e-rulemaking...
Claire Cardie, Cynthia Farina, Adil Aijaz, Matt Ra...
ECAI
2008
Springer
14 years 1 months ago
Author Identification Using a Tensor Space Representation
Author identification is a text categorization task with applications in intelligence, criminal law, computer forensics, etc. Usually, in such cases there is shortage of training t...
Spyridon Plakias, Efstathios Stamatatos
IJCNN
2000
IEEE
14 years 4 months ago
Support Vector Machines Based on a Semantic Kernel for Text Categorization
We propose to solve a text categorization task using a new metric between documents, based on a priori semantic knowledge about words. This metric can be incorporated into the def...
George Siolas, Florence d'Alché-Buc