A Boosting Approach to Topic Spotting on Subdialogues

16 years 7 months ago

Download www.ml.cmu.edu

We report the results of a study on topic spotting in conversational speech. Using a machine learning approach, we build classifiers that accept an audio file of conversational human speech as input, and output an estimate of the topic being discussed. Our methodology makes use of a wellknown corpus of transcribed and topic-labeled speech (the Switchboard corpus), and involves an interesting double use of the BOOSTEXTER learning algorithm. Our work is distinguished from previous efforts in topic spotting by our explicit study of the effects of dialogue length on classifier performance, and by our use of off-theshelf speech recognition technology. One of our main results is the identification of a single classifier with good performance (relative to our classifier space) across all subdialogue lengths.

Kary Myers, Michael J. Kearns, Satinder P. Singh,

Real-time Traffic

Classifier Performance | Conversational Human Speech | ICML 2000 | Machine Learning | Topic Spotting |

claim paper

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2000
Where	ICML
Authors	Kary Myers, Michael J. Kearns, Satinder P. Singh, Marilyn A. Walker

Comments (0)

Sciweavers

A Boosting Approach to Topic Spotting on Subdialogues

Classifier Performance | Conversational Human Speech | ICML 2000 | Machine Learning | Topic Spotting |

Explore & Download

Productivity Tools

Sciweavers