Sciweavers

60 search results - page 5 / 12
» Text Classification from Labeled and Unlabeled Documents usi...
Sort
View
NIPS
2008
13 years 9 months ago
Semi-supervised Learning with Weakly-Related Unlabeled Data: Towards Better Text Categorization
The cluster assumption is exploited by most semi-supervised learning (SSL) methods. However, if the unlabeled data is merely weakly related to the target classes, it becomes quest...
Liu Yang, Rong Jin, Rahul Sukthankar
ICML
2004
IEEE
14 years 8 months ago
Co-EM support vector learning
Multi-view algorithms, such as co-training and co-EM, utilize unlabeled data when the available attributes can be split into independent and compatible subsets. Co-EM outperforms ...
Ulf Brefeld, Tobias Scheffer
ECIR
2008
Springer
13 years 9 months ago
Semi-supervised Document Classification with a Mislabeling Error Model
Abstract. This paper investigates a new extension of the Probabilistic Latent Semantic Analysis (PLSA) model [6] for text classification where the training set is partially labeled...
Anastasia Krithara, Massih-Reza Amini, Jean-Michel...
KDD
2009
ACM
156views Data Mining» more  KDD 2009»
14 years 8 months ago
Effective multi-label active learning for text classification
Labeling text data is quite time-consuming but essential for automatic text classification. Especially, manually creating multiple labels for each document may become impractical ...
Bishan Yang, Jian-Tao Sun, Tengjiao Wang, Zheng Ch...
DAWAK
2008
Springer
13 years 9 months ago
Document-Base Extraction for Single-Label Text Classification
Many text mining applications, especially when investigating Text Classification (TC), require experiments to be performed using common textcollections, such that results can be co...
Yanbo J. Wang, Robert Sanderson, Frans Coenen, Pau...