Sciweavers

CLEAR
2007
Springer

Multi-stage Speaker Diarization for Conference and Lecture Meetings

14 years 6 months ago
Multi-stage Speaker Diarization for Conference and Lecture Meetings
The LIMSI RT-07S speaker diarization system for the conference and lecture meetings is presented in this paper. This system builds upon the RT06S diarization system designed for lecture data. The baseline system combines agglomerative clustering based on Bayesian information criterion (BIC) with a second clustering using state-of-the-art speaker identification (SID) techniques. Since the baseline system provides a high speech activity detection (SAD) error around of 10% on lecture data, some different acoustic representations with various normalization techniques are investigated within the framework of loglikelihood ratio (LLR) based speech activity detector. UBMs trained on the different types of acoustic features are also examined in the SID clustering stage. All SAD acoustic models and UBMs are trained with the forced alignment segmentations of the conference data. The diarization system integrating the new SAD models and UBM gives comparable results on both the RT-07S conference ...
Xuan Zhu, Claude Barras, Lori Lamel, Jean-Luc Gauv
Added 07 Jun 2010
Updated 07 Jun 2010
Type Conference
Year 2007
Where CLEAR
Authors Xuan Zhu, Claude Barras, Lori Lamel, Jean-Luc Gauvain
Comments (0)