Sciweavers

KI
2008
Springer

Multi-value Classification of Very Short Texts

14 years 12 days ago
Multi-value Classification of Very Short Texts
We introduce a new stacking-like approach for multi-value classification. We apply this classification scheme using Naive Bayes, Rocchio and kNN classifiers on the well-known Reuters dataset. We use part-of-speech tagging for stopword removal. We show that our setup performs almost as well as other approaches that use the full article text even though we only classify headlines. Finally, we apply a Rocchio classifier on a dataset from a Web 2.0 site and show that it is suitable for semi-automated labelling (often called tagging) of short texts and is faster than other approaches.
Andreas Heß, Philipp Dopichaj, Christian Maa
Added 13 Dec 2010
Updated 13 Dec 2010
Type Journal
Year 2008
Where KI
Authors Andreas Heß, Philipp Dopichaj, Christian Maaß
Comments (0)