Sciweavers

85 search results - page 5 / 17
» Improving Text Classification by Web Corpora
Sort
View
AUSDM
2006
Springer
144views Data Mining» more  AUSDM 2006»
13 years 11 months ago
A Characterization of Wordnet Features in Boolean Models For Text Classification
Supervised text classification is the task of automatically assigning a category label to a previously unlabeled text document. We start with a collection of pre-labeled examples ...
Trevor N. Mansuy, Robert J. Hilderman
SAC
2004
ACM
14 years 27 days ago
Classifying biological articles using web resources
Text classification systems on biomedical literature aim to select relevant articles to a specific issue from large corpora. Most systems with an acceptable accuracy are based o...
Francisco M. Couto, Bruno Martins, Mário J....
ICDM
2008
IEEE
164views Data Mining» more  ICDM 2008»
14 years 1 months ago
Classifying High-Dimensional Text and Web Data Using Very Short Patterns
In this paper, we propose the "Democratic Classifier", a simple, democracy-inspired patternbased classification algorithm that uses very short patterns for classificatio...
Hassan H. Malik, John R. Kender
SIGIR
2008
ACM
13 years 7 months ago
Classifiers without borders: incorporating fielded text from neighboring web pages
Accurate web page classification often depends crucially on information gained from neighboring pages in the local web graph. Prior work has exploited the class labels of nearby p...
Xiaoguang Qi, Brian D. Davison
LREC
2010
144views Education» more  LREC 2010»
13 years 9 months ago
Towards an Improved Methodology for Automated Readability Prediction
Since the first half of the 20th century, readability formulas have been widely employed to automatically predict the readability of an unseen text. In this article, the formulas ...
Philip van Oosten, Dries Tanghe, Véronique ...