Detecting Structural Metadata with Decision Trees and Transformation-Based Learning

14 years 3 months ago

Download acl.ldc.upenn.edu

The regular occurrence of disfluencies is a distinguishing characteristic of spontaneous speech. Detecting and removing such disfluencies can substantially improve the usefulness of spontaneous speech transcripts. This paper presents a system that detects various types of disfluencies and other structural information with cues obtained from lexical and prosodic information sources. Specifically, combinations of decision trees and language models are used to predict sentence ends and interruption points and, given these events, transformationbased learning is used to detect edit disfluencies and conversational fillers. Results are reported on human and automatic transcripts of conversational telephone speech.

Joungbum Kim, Sarah E. Schwarm, Mari Ostendorf

Real-time Traffic

Conversational Telephone Speech | NAACL 2004 | NAACL 2007 | Spontaneous Speech | Spontaneous Speech Transcripts |

claim paper

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2004
Where	NAACL
Authors	Joungbum Kim, Sarah E. Schwarm, Mari Ostendorf

Comments (0)

Sciweavers

Detecting Structural Metadata with Decision Trees and Transformation-Based Learning

Conversational Telephone Speech | NAACL 2004 | NAACL 2007 | Spontaneous Speech | Spontaneous Speech Transcripts |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers