Sciweavers

MLMI
2005
Springer

The TNO Speaker Diarization System for NIST RT05s Meeting Data

14 years 6 months ago
The TNO Speaker Diarization System for NIST RT05s Meeting Data
The TNO speaker speaker diarization system is based on a standard BIC segmentation and clustering algorithm. Since for the NIST Rich Transcription speaker dizarization evaluation measure correct speech detection appears to be essential, we have developed a speech activity detector (SAD) as well. This is based on decoding the speech signal using two Gaussian Mixture Models trained on silence and speech. The SAD was trained on only AMI development test data, and performed quite well in the evaluation on all 5 meeting locations, with a SAD error rate of 5.0 %. For the speaker clustering algorithm we optimized the BIC penalty parameter λ to 14, which is quite high with respect to
David van Leeuwen
Added 28 Jun 2010
Updated 28 Jun 2010
Type Conference
Year 2005
Where MLMI
Authors David van Leeuwen
Comments (0)