Improving Speaker Diarization by Cross EM Refinement

14 years 6 months ago

Download www.cecs.uci.edu

In this paper, we present a new speaker diarization system that improves the accuracy of traditional hierarchical clustering-based methods with little increase in computational cost. Our contributions are mainly two fold. First, we include a preprocessing called “local clustering” before the hierarchical clustering algorithm to merge very similar adjacent speech segments. This local clustering aims to reduce the number of segments to be clustered by the hierarchical clustering, so as to dramatically increase the processing speed. Second, we perform a postprocessing called “cross EM reﬁnement” to purify the clusters generated by the hierarchical clustering. This algorithm is based on the idea of cross validation and EM algorithm. Our experimental evaluations show that the proposed cross EM reﬁnement approach reduces the speaker diarization error by up to 56%, with an average reduction of 22% compared to the traditional hierarchical clustering method.

Huazhong Ning, Wei Xu, Yihong Gong, Thomas S. Huan

Real-time Traffic

Hierarchical Clustering | Hierarchical Clustering Algorithm | ICMCS 2006 | Local Clustering |

claim paper

Post Info
More Details (n/a)

Added	11 Jun 2010
Updated	11 Jun 2010
Type	Conference
Year	2006
Where	ICMCS
Authors	Huazhong Ning, Wei Xu, Yihong Gong, Thomas S. Huang

Comments (0)

Sciweavers

Improving Speaker Diarization by Cross EM Refinement

Hierarchical Clustering | Hierarchical Clustering Algorithm | ICMCS 2006 | Local Clustering |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers