Integrating binaural cues and blind source separation method for separating reverberant speech mixtures

13 years 11 months ago

Download mirlab.org

This paper presents a new method for reverberant speech separation, based on the combination of binaural cues and blind source separation (BSS) for the automatic classiﬁcation of the time-frequency (T-F) units of the speech mixture spectrogram. The main idea is to model interaural phase difference, interaural level difference and frequency bin-wise mixing vectors by Gaussian mixture models for each source and then evaluate that model at each T-F point and assign the units with high probability to that source. The model parameters and the assigned regions are reﬁned iteratively using the Expectation-Maximization (EM) algorithm. The proposed method also addresses the permutation problem of the frequency domain BSS by initializing the mixing vectors for each frequency channel. The EM algorithm starts with binaural cues and after a few iterations the estimated probabilistic mask is used to initialize and re-estimate the mixing vector model parameters. We performed experiments on speec...

Atiyeh Alinaghi, Wenwu Wang, Philip J. B. Jackson

Real-time Traffic

Binaural Cues | ICASSP 2011 | Mixing Vector | Signal Processing | Speech Mixtures |

claim paper

Post Info
More Details (n/a)

Added	20 Aug 2011
Updated	20 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Atiyeh Alinaghi, Wenwu Wang, Philip J. B. Jackson

Comments (0)

Sciweavers

Integrating binaural cues and blind source separation method for separating reverberant speech mixtures

Binaural Cues | ICASSP 2011 | Mixing Vector | Signal Processing | Speech Mixtures |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers