Sciweavers

85 search results - page 4 / 17
» Improving Text Classification by Web Corpora
Sort
View
KDD
2004
ACM
160views Data Mining» more  KDD 2004»
14 years 7 months ago
Boosting for Text Classification with Semantic Features
Abstract. Current text classification systems typically use term stems for representing document content. Semantic Web technologies allow the usage of features on a higher semantic...
Stephan Bloehdorn, Andreas Hotho
AIME
2003
Springer
14 years 21 days ago
Learning Derived Words from Medical Corpora
Abstract. Morphological knowledge (inflection, derivation, compounds) is useful for medical language processing. Some is available for medical English in the UMLS Specialist Lexic...
Pierre Zweigenbaum, Natalia Grabar
IJDLS
2010
108views more  IJDLS 2010»
13 years 4 months ago
Sampling the Web as Training Data for Text Classification
Data acquisition is a major concern in text classification. The excessive human efforts required by conventional methods to build up quality training collection might not always b...
Wei-Yen Day, Chun-Yi Chi, Ruey-Cheng Chen, Pu-Jen ...
LREC
2008
108views Education» more  LREC 2008»
13 years 9 months ago
A Lightweight and Efficient Tool for Cleaning Web Pages
Originally conceived as a "naive" baseline experiment using traditional n-gram language models as classifiers, the NCLEANER system has turned out to be a fast and lightw...
Stefan Evert
ICCS
2007
Springer
14 years 1 months ago
Text Classification with Support Vector Machine and Back Propagation Neural Network
Abstract. We compared a support vector machine (SVM) with a back propagation neural network (BPNN) for the task of text classification of XiangShan science conference (XSSC) web do...
Wen Zhang, Xijin Tang, Taketoshi Yoshida