Sciweavers

68 search results - page 8 / 14
» Bayesian online classifiers for text classification and filt...
Sort
View
AUSAI
2008
Springer
13 years 10 months ago
Cross-Domain Knowledge Transfer Using Semi-supervised Classification
Traditional text classification algorithms are based on a basic assumption: the training and test data should hold the same distribution. However, this identical distribution assum...
Yi Zhen, Chunping Li
JMLR
2006
125views more  JMLR 2006»
13 years 8 months ago
Spam Filtering Using Statistical Data Compression Models
Spam filtering poses a special problem in text categorization, of which the defining characteristic is that filters face an active adversary, which constantly attempts to evade fi...
Andrej Bratko, Gordon V. Cormack, Bogdan Filipic, ...
ADCS
2004
13 years 10 months ago
Co-Training on Textual Documents with a Single Natural Feature Set
Co-training is a semi-supervised technique that allows classifiers to learn with fewer labelled documents by taking advantage of the more abundant unclassified documents. However, ...
Jason Chan, Irena Koprinska, Josiah Poon
NIPS
2001
13 years 10 months ago
Latent Dirichlet Allocation
We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of discrete data such as text corpora. LDA is a three-level hierarchical Bayesian m...
David M. Blei, Andrew Y. Ng, Michael I. Jordan
DL
1998
Springer
111views Digital Library» more  DL 1998»
14 years 24 days ago
SONIA: A Service for Organizing Networked Information Autonomously
The recent explosion of on-line information in Digital Libraries and on the World Wide Web has given rise to a number of query-based search engines and manually constructed topica...
Mehran Sahami, Salim Yusufali, Michelle Q. Wang Ba...