Sciweavers

111 search results - page 11 / 23
» Improved k-NN Algorithm for Text Classification
Sort
View
IFIP12
2004
13 years 8 months ago
Impact on Performance of Hypertext Classification of Selective Rich HTML Capture
: Hypertext categorization is the automatic classification of web documents into predefined classes. It poses new challenges for automatic categorization because of the rich inform...
Houda Benbrahim, Max Bramer
PRIS
2010
13 years 5 months ago
The Impact of Pre-processing on the Classification of MEDLINE Documents
The amount of information available in the MEDLINE database makes it very hard for a researcher to retrieve a reasonable amount of relevant documents using a simple query language ...
Carlos Adriano Gonçalves, Célia Talm...
DIS
2006
Springer
13 years 11 months ago
Model-Based Estimation of Word Saliency in Text
Abstract. We investigate a generative latent variable model for modelbased word saliency estimation for text modelling and classification. The estimation algorithm derived is able ...
Xin Wang, Ata Kabán
AAAI
1998
13 years 8 months ago
Learning to Classify Text from Labeled and Unlabeled Documents
In many important text classification problems, acquiring class labels for training documents is costly, while gathering large quantities of unlabeled data is cheap. This paper sh...
Kamal Nigam, Andrew McCallum, Sebastian Thrun, Tom...
SIGIR
2005
ACM
14 years 1 months ago
On the collective classification of email "speech acts"
We consider classification of email messages as to whether or not they contain certain “email acts”, such as a request or a commitment. We show that exploiting the sequential ...
Vitor Rocha de Carvalho, William W. Cohen