An acoustically-motivated spatial prior for under-determined reverberant source separation

14 years 11 months ago

Download mirlab.org

We consider the task of under-determined reverberant audio source separation. We model the contribution of each source to all mixture channels in the time-frequency domain as a zero-mean Gaussian random vector with full-rank spatial covariance matrix. We introduce an inverse Wishart prior over the covariance matrices, whose mean is given by the theory of statistical room acoustics and whose variance is learned from training data. We then derive an Expectation-Maximization (EM) algorithm to estimate the model parameters in the Maximum A Posteriori (MAP) sense given prior knowledge about the microphone spacing and the source positions. This algorithm provides a principled solution to the well-known permutation problem and achieves better separation performance than other algorithms exploiting the same prior knowledge.

Ngoc Q. K. Duong, Emmanuel Vincent, Rémi Gr

Real-time Traffic

Full-rank Spatial Covariance | Gaussian Random Vector | ICASSP 2011 | Reverberant Audio Source | Signal Processing |

claim paper

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Ngoc Q. K. Duong, Emmanuel Vincent, Rémi Gribonval

Sciweavers

An acoustically-motivated spatial prior for under-determined reverberant source separation

Full-rank Spatial Covariance | Gaussian Random Vector | ICASSP 2011 | Reverberant Audio Source | Signal Processing |

Explore & Download

Productivity Tools

Sciweavers