Email classification with co-training

14 years 1 months ago

Download www.site.uottawa.ca

The main problems in text classification are lack of labeled data, as well as the cost of labeling the unlabeled data. We address these problems by exploring co-training - an algorithm that uses unlabeled data along with a few labeled examples to boost the performance of a classifier. We experiment with co-training on the email domain. Our results show that the performance of co-training depends on the learning algorithm it uses. In particular, Support Vector Machines significantly outperforms Naive Bayes on email classification.

Svetlana Kiritchenko, Stan Matwin

Real-time Traffic

CASCON 2001 | CASCON 2007 | Co-training | Text Classification | Unlabeled Data |

claim paper

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2001
Where	CASCON
Authors	Svetlana Kiritchenko, Stan Matwin

Comments (0)

Sciweavers

Email classification with co-training

CASCON 2001 | CASCON 2007 | Co-training | Text Classification | Unlabeled Data |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers