Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

133

DASFAA
2004
IEEE

favoriteEmaildiscussreport

135views Database» more DASFAA 2004»

Semi-supervised Text Classification Using Partitioned EM

15 years 5 months ago

Semi-supervised Text Classification Using Partitioned EM

Download www.cs.uic.edu

Text classification using a small labeled set and a large unlabeled data is seen as a promising technique to reduce the labor-intensive and time consuming effort of labeling training data in order to build accurate classifiers since unlabeled data is easy to get from the Web. In [16] it has been demonstrated that an unlabeled set improves classification accuracy significantly with only a small labeled training set. However, the Bayesian method used in [16] assumes that text documents are generated from a mixture model and there is a one-to-one correspondence between the mixture components and the classes. This may not be the case in many applications. In many real-life applications, a class may cover documents from many different topics, which violates the oneto-one correspondence assumption. In such cases, the resulting classifiers can be quite poor. In this paper, we propose a clustering based partitioning technique to solve the problem. This method first partitions the training docu...

Gao Cong, Wee Sun Lee, Haoran Wu, Bing Liu

Real-time Traffic

DASFAA 2004 | Database | Large Unlabeled Data | Small Labeled Set | Unlabeled Data |

claim paper

Related Content

» A SemiSupervised Document Clustering Algorithm Based on EM

» Cross Language Text Classification by Model Translation and SemiSupervised Learning

» Multilabel ASRS Dataset Classification Using Semi Supervised Subspace Clustering

» SISC A Text Classification Approach Using Semi Supervised Subspace Clustering

» SemiSupervised Text Classification Using Positive and Unlabeled Data

» On SemiSupervised Classification

» SemiSupervised Learning for Semantic Relation Classification using Stratified Sampling Str...

» Enhancing the Performance of SemiSupervised Classification Algorithms with Bridging

» SemiSupervised Learning Using Gaussian Fields and Harmonic Functions

» Asymptotic Analysis of Generative SemiSupervised Learning

Post Info
More Details (n/a)

Added	20 Aug 2010
Updated	20 Aug 2010
Type	Conference
Year	2004
Where	DASFAA
Authors	Gao Cong, Wee Sun Lee, Haoran Wu, Bing Liu

Comments (0)