Sciweavers

315 search results - page 35 / 63
» Text classification from positive and unlabeled documents
Sort
View
WWW
2009
ACM
14 years 8 months ago
Latent space domain transfer between high dimensional overlapping distributions
Transferring knowledge from one domain to another is challenging due to a number of reasons. Since both conditional and marginal distribution of the training data and test data ar...
Sihong Xie, Wei Fan, Jing Peng, Olivier Verscheure...
ECIR
2004
Springer
13 years 9 months ago
Improving Retrieval Effectiveness by Reranking Documents Based on Controlled Vocabulary
Abstract. There is a common availability of classification terms in online text collections and digital libraries, such as manually assigned keywords or key-phrases from a controll...
Jaap Kamps
DMIN
2006
146views Data Mining» more  DMIN 2006»
13 years 9 months ago
A Comparison of Two Document Clustering Approaches for Clustering Medical Documents
Medical data is often presented as free text in the form of medical reports. Such documents contain important information about patients, disease progression and management, but ar...
Fathi H. Saad, Beatriz de la Iglesia, Duncan G. Be...
IPM
2006
146views more  IPM 2006»
13 years 7 months ago
Dictionary-based text categorization of chemical web pages
A new dictionary-based text categorization approach is proposed to classify the chemical web pages efficiently. Using a chemistry dictionary, the approach can extract chemistry-re...
Chunyan Liang, Li Guo, Zhaojie Xia, Feng-Guang Nie...
ICML
2004
IEEE
14 years 8 months ago
Co-EM support vector learning
Multi-view algorithms, such as co-training and co-EM, utilize unlabeled data when the available attributes can be split into independent and compatible subsets. Co-EM outperforms ...
Ulf Brefeld, Tobias Scheffer