Sciweavers

ICASSP
2007
IEEE

Unsupervised Audio Segmentation using Extended Baum-Welch Transformations

14 years 6 months ago
Unsupervised Audio Segmentation using Extended Baum-Welch Transformations
Audio segmentation has applications in a variety of contexts, such as audio information retrieval, automatic sound analysis, and as a pre-processing step in speech recognition. Extended Baum-Welch (EBW) transformations are most commonly used as a discriminative technique for estimating parameters of Gaussian mixtures. In this paper, we derive an unsupervised audio segmentation approach using these transformations. We nd that our algorithm outperforms both the Bayesian Information Criterion (BIC) and Cumulative Sum (CUSUM) segmentation methods. In particular, our EBW segmentation algorithm provides improvements over the baseline approaches in detecting landmarks of short duration and minimizing landmark oversegmentation. In addition, we show that the EBW approach provides faster computation compared to the baseline methods.
Tara N. Sainath, Dimitri Kanevsky, Giridharan Iyen
Added 02 Jun 2010
Updated 02 Jun 2010
Type Conference
Year 2007
Where ICASSP
Authors Tara N. Sainath, Dimitri Kanevsky, Giridharan Iyengar
Comments (0)