Unsupervised Topic Modelling for Multi-Party Spoken Discourse

14 years 4 months ago

Download web.mit.edu

We present a method for unsupervised topic modelling which adapts methods used in document classification (Blei et al., 2003; Griffiths and Steyvers, 2004) to unsegmented multi-party discourse transcripts. We show how Bayesian inference in this generative model can be used to simultaneously address the problems of topic segmentation and topic identification: automatically segmenting multi-party meetings into topically coherent segments with performance which compares well with previous unsupervised segmentation-only methods (Galley et al., 2003) while simultaneously extracting topics which rate highly when assessed for coherence by human judges. We also show that this method appears robust in the face of off-topic dialogue and speech recognition errors.

Matthew Purver, Konrad P. Körding, Thomas L.

Real-time Traffic

ACL 2006 | ACL 2007 | Unsegmented Multi-party Discourse | Unsupervised Segmentation-only Methods | Unsupervised Topic |

claim paper

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2006
Where	ACL
Authors	Matthew Purver, Konrad P. Körding, Thomas L. Griffiths, Joshua B. Tenenbaum

Comments (0)

Sciweavers

Unsupervised Topic Modelling for Multi-Party Spoken Discourse

ACL 2006 | ACL 2007 | Unsegmented Multi-party Discourse | Unsupervised Segmentation-only Methods | Unsupervised Topic |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers