Part-of-speech histograms for genre classification of text

16 years 1 months ago

Download ee.washington.edu

This work addresses the problem of classifying the genre of text, which is useful for a variety of language processing problems. We propose statistics of POS histograms as classiﬁcation features, coupled with a quadratic discriminant classiﬁer. In experiments on six different text and speech genres, we demonstrate enhanced performance compared to standard techniques using word frequency count features and POS trigram features. Experiments on genres that were not seen in training show intuitive overlaps with the training classes.

Sergey Feldman, Marius A. Marin, Mari Ostendorf, M

Real-time Traffic

ICASSP 2009 | POS Histograms | POS Trigram Features | Quadratic Discriminant Classiﬁer | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	21 May 2010
Updated	21 May 2010
Type	Conference
Year	2009
Where	ICASSP
Authors	Sergey Feldman, Marius A. Marin, Mari Ostendorf, Maya R. Gupta

Comments (0)

Sciweavers

Part-of-speech histograms for genre classification of text

ICASSP 2009 | POS Histograms | POS Trigram Features | Quadratic Discriminant Classiﬁer | Signal Processing |

Explore & Download

Productivity Tools

Sciweavers