Sciweavers

ICASSP
2009
IEEE

Part-of-speech histograms for genre classification of text

14 years 7 months ago
Part-of-speech histograms for genre classification of text
This work addresses the problem of classifying the genre of text, which is useful for a variety of language processing problems. We propose statistics of POS histograms as classification features, coupled with a quadratic discriminant classifier. In experiments on six different text and speech genres, we demonstrate enhanced performance compared to standard techniques using word frequency count features and POS trigram features. Experiments on genres that were not seen in training show intuitive overlaps with the training classes.
Sergey Feldman, Marius A. Marin, Mari Ostendorf, M
Added 21 May 2010
Updated 21 May 2010
Type Conference
Year 2009
Where ICASSP
Authors Sergey Feldman, Marius A. Marin, Mari Ostendorf, Maya R. Gupta
Comments (0)