Sciweavers

ICWSM
2008

An Automatic Classification of Book Texts to User-Defined Tags

14 years 1 months ago
An Automatic Classification of Book Texts to User-Defined Tags
We describe work on automatically assigning labels to books using user-defined tags as the label set. Using supervised learning and exploring both binary and multiclass classification, we train and test classifiers on several sets of features, focusing on the size of the sets, part-of-speech classes and named entities. Results indicate that a binary classifier, trained and tested on a feature space that consists of a limited selection of parts of speech as well as all frequent named entities, achieves a classification precision of 81%, significantly outperforming a baseline which assigns the top-10 most popular tags to each book.
Sharon Givon, Theresa Wilson
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2008
Where ICWSM
Authors Sharon Givon, Theresa Wilson
Comments (0)