Sciweavers

ICASSP
2010
IEEE

Jointly recognizing multi-speaker conversations

14 years 23 days ago
Jointly recognizing multi-speaker conversations
We suggest an approach to speech recognition where multiple sides of a conversation in a dialog or meeting are processed and decoded jointly rather than independently. We moreover introduce a practical implementation of this approach that demonstrates both language model perplexity and speech recognition word error rate improvements in conversational telephone speech. Specifically, we show that such benefits can be had if a n-gram language model, in addition to conditioning on immediately preceding words in an utterance, is also allowed to condition on the estimated dialog-act of the immediately preceding utterance of an alternate speaker.
Gang Ji, Jeff Bilmes
Added 06 Dec 2010
Updated 06 Dec 2010
Type Conference
Year 2010
Where ICASSP
Authors Gang Ji, Jeff Bilmes
Comments (0)