Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

62

MLMI
2007
Springer

favoriteEmaildiscussreport

121views Machine Learning» more MLMI 2007»

Modeling Vocal Interaction for Segmentation in Meeting Recognition

15 years 11 days ago

Modeling Vocal Interaction for Segmentation in Meeting Recognition

Download www.cs.cmu.edu

Automatic segmentation is an important technology for both automatic speech recognition and automatic speech understanding. In meetings, participants typically vocalize for only a fraction of the recorded time, but standard vocal activity detection algorithms for close-talk microphones in meetings continue to treat participants independently. In this work we present a multispeaker segmentation system which models a particular aspect of human-human communication, that of vocal interaction or the interdependence between participants’ on-oﬀ speech patterns. We describe our vocal interaction model, its training, and its use during vocal activity decoding. Our experiments show that this approach almost completely eliminates the problem of crosstalk, and word error rates on our development set are lower than those obtained with human-generatated reference segmentation. We also observe signiﬁcant performance improvements on unseen data.

Kornel Laskowski, Tanja Schultz

Real-time Traffic

Automatic Speech | Machine Learning | MLMI 2007 | Vocal Activity | Vocal Interaction |

claim paper

Related Content

» Multimodal Integration for Meeting Group Action Segmentation and Recognition

» Contrasting emotionbearing laughter types in multiparticipant vocal activity detection for...

» Modeling Naturalistic Affective States Via Facial Vocal and Bodily Expressions Recognition

» Recognition of Dialogue Acts in Multiparty Meetings Using a Switching DBN

» Impact of automatic sentence segmentation on meeting summarization

» The SRI NIST 2010 speaker recognition evaluation system

» A MultiModal MixedState Dynamic Bayesian Network for Robust Meeting Event Recognition from...

» Hidden Conditional Random Fields for Meeting Segmentation

» Multistream Dynamic Bayesian Network for Meeting Segmentation

Post Info
More Details (n/a)

Added	08 Jun 2010
Updated	08 Jun 2010
Type	Conference
Year	2007
Where	MLMI
Authors	Kornel Laskowski, Tanja Schultz

Comments (0)