Structural Metadata Annotation of Speech Corpora: Comparing Broadcast News and Broadcast Conversations

15 years 8 months ago

Download www.mde.zcu.cz

Structural metadata extraction (MDE) research aims to develop techniques for automatic conversion of raw speech recognition output to forms that are more useful to humans and to downstream automatic processes. It may be achieved by inserting boundaries of syntactic/semantic units to the flow of speech, labeling non-content words like filled pauses and discourse markers for optional removal, and identifying sections of disfluent speech. This paper compares two Czech MDE speech corpora

Jáchym Kolár, Jan Svec

Real-time Traffic

Education | LREC 2008 | MDE Speech Corpora | Speech Recognition Output | Syntactic/semantic Units |

claim paper

» The Czech Broadcast Conversation Corpus

» Many uses many annotations for large speech corpora Switchboard and TDT as case studies

» Multimedia with a speech track searching spontaneous conversational speech

» To Memorize or to Predict Prominence labeling in Conversational Speech

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	LREC
Authors	Jáchym Kolár, Jan Svec

Comments (0)

Sciweavers

Structural Metadata Annotation of Speech Corpora: Comparing Broadcast News and Broadcast Conversations

Education | LREC 2008 | MDE Speech Corpora | Speech Recognition Output | Syntactic/semantic Units |

Explore & Download

Productivity Tools

Sciweavers