Sciweavers

CSL
2010
Springer

Speech separation using speaker-adapted eigenvoice speech models

14 years 20 days ago
Speech separation using speaker-adapted eigenvoice speech models
We present a system for model-based source separation for use on single channel speech mixtures where the precise source characteristics are not known a priori. The sources are modeled using hidden Markov models (HMM) and separated using factorial HMM methods. Without prior speaker models for the sources in the mixture it is difficult to exactly resolve the individual sources because there is no way to determine which state corresponds to which source at any point in time. This is solved to a small extent by the temporal constraints provided by the Markov models, but permutations between sources remains a significant problem. We overcome this by adapting the models to match the sources in the mixture. We do this by representing the space of speaker variation with a parametric signal model based on the eigenvoice technique for rapid speaker adaptation. We present an algorithm to infer the characteristics of the sources present in a mixture, allowing for significantly improved separatio...
Ron J. Weiss, Daniel P. W. Ellis
Added 09 Dec 2010
Updated 09 Dec 2010
Type Journal
Year 2010
Where CSL
Authors Ron J. Weiss, Daniel P. W. Ellis
Comments (0)