Lessons Learned in Part-of-Speech Tagging of Conversational Speech

13 years 10 months ago

Download www.cs.umd.edu

This paper examines tagging models for spontaneous English speech transcripts. We analyze the performance of state-of-the-art tagging models, either generative or discriminative, left-to-right or bidirectional, with or without latent annotations, together with the use of ToBI break indexes and several methods for segmenting the speech transcripts (i.e., conversation side, speaker turn, or humanannotated sentence). Based on these studies, we observe that: (1) bidirectional models tend to achieve better accuracy levels than left-toright models, (2) generative models seem to perform somewhat better than discriminative models on this task, and (3) prosody improves tagging performance of models on conversation sides, but has much less impact on smaller segments. We conclude that, although the use of break indexes can indeed significantly improve performance over baseline models without them on conversation sides, tagging accuracy improves more by using smaller segments, for which the impac...

Vladimir Eidelman, Zhongqiang Huang, Mary P. Harpe

Real-time Traffic

Break Indexes | Conversation Sides | EMNLP 2010 | Natural Language Processing | Smaller Segments |

claim paper

Post Info
More Details (n/a)

Added	11 Feb 2011
Updated	11 Feb 2011
Type	Journal
Year	2010
Where	EMNLP
Authors	Vladimir Eidelman, Zhongqiang Huang, Mary P. Harper

Comments (0)

Sciweavers

Lessons Learned in Part-of-Speech Tagging of Conversational Speech

Break Indexes | Conversation Sides | EMNLP 2010 | Natural Language Processing | Smaller Segments |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers