Prosody-Based Automatic Segmentation of Speech into Sentences and Topics

15 years 6 months ago

Download www.speech.sri.com

A crucial step in processing speech audio data for information extraction, topic detection, or browsing/playback is to segment the input into sentence and topic units. Speech segmentation is challenging, since the cues typically present for segmenting text (headers, paragraphs, punctuation) are absent in spoken language. We investigate the use of prosody (information gleaned from the timing and melody of speech) for these tasks. Using decision tree and hidden Markov modeling techniques, we combine prosodic cues with word-based approaches, and evaluate performance on two speech corpora, Broadcast News and Switchboard. Results show that the prosodic model alone performs on par with, or better than, word-based statistical language models

Elizabeth Shriberg, Andreas Stolcke, Dilek Z. Hakk

Real-time Traffic

CORR 2000 | Education | Prosodic | Prosodic Cues | Prosodic Models |

claim paper

» Genre effects on automatic sentence segmentation of speech A comparison of broadcast news ...

» Impact of automatic sentence segmentation on meeting summarization

» Evaluation of semantic role labeling and dependency parsing of automatic speech recognitio...

» Modeling Topic Coherence for Speech Recognition

» CrossGenre Feature Comparisons for Spoken Sentence Segmentation

» Gestural Cohesion for Topic Segmentation

» Unsupervised Topic Adaptation for Lecture Speech Retrieval

» On the Use of Web Resources and Natural Language Processing Techniques to Improve Automati...

Post Info
More Details (n/a)

Added	17 Dec 2010
Updated	17 Dec 2010
Type	Journal
Year	2000
Where	CORR
Authors	Elizabeth Shriberg, Andreas Stolcke, Dilek Z. Hakkani-Tür, Gökhan Tür

Comments (0)

Sciweavers

Prosody-Based Automatic Segmentation of Speech into Sentences and Topics

CORR 2000 | Education | Prosodic | Prosodic Cues | Prosodic Models |

Explore & Download

Productivity Tools

Sciweavers