The IBM RT07 Evaluation Systems for Speaker Diarization on Lecture Meetings

16 years 1 months ago

Download www.ics.forth.gr

We present the IBM systems for the Rich Transcription 2007 (RT07) speaker diarization evaluation task on lecture meeting data. We ﬁrst overview our baseline system that was developed last year, as part of our speech-to-text system for the RT06s evaluation. We then present a number of simple schemes considered this year in our eﬀort to improve speaker diarization performance, namely: (i) A better speech activity detection (SAD) system, a necessary pre-processing step to speaker diarization; (ii) Use of word information from a speaker-independent speech recognizer; (iii) Modiﬁcations to speaker cluster merging criteria and the underlying segment model; and (iv) Use of speaker models based on Gaussian mixture models, and their iterative reﬁnement by frame-level re-labeling and smoothing of decision likelihoods. We report development experiments on the RT06s evaluation test set that demonstrate that these methods are eﬀective, resulting in dramatic performance improvements over o...

Jing Huang, Etienne Marcheret, Karthik Visweswaria

Real-time Traffic

Biometrics | CLEAR 2007 | Speaker Cluster | Speaker Diarization | Speaker Error |

claim paper

» The LIA RT07 Speaker Diarization System

» Multistage Speaker Diarization for Conference and Lecture Meetings

» The ICSI RT07s Speaker Diarization System

» The SRIICSI Spring 2007 Meeting and Lecture Recognition System

» The Rich Transcription 2007 Meeting Recognition Evaluation

Post Info
More Details (n/a)

Added	07 Jun 2010
Updated	07 Jun 2010
Type	Conference
Year	2007
Where	CLEAR
Authors	Jing Huang, Etienne Marcheret, Karthik Visweswariah, Gerasimos Potamianos

Comments (0)

Sciweavers

The IBM RT07 Evaluation Systems for Speaker Diarization on Lecture Meetings

Biometrics | CLEAR 2007 | Speaker Cluster | Speaker Diarization | Speaker Error |

Explore & Download

Productivity Tools

Sciweavers