Sciweavers

ICASSP
2009
IEEE

A speech fragment approach to localising multiple speakers in reverberant environments

14 years 5 months ago
A speech fragment approach to localising multiple speakers in reverberant environments
Sound source localisation cues are severely degraded when multiple acoustic sources are active in the presence of reverberation. We present a binaural system for localising simultaneous speakers which exploits the fact that in a speech mixture there exist spectro-temporal regions or ‘fragments’, where the energy is dominated by just one of the speakers. A fragment-level localisation model is proposed that integrates the localisation cues within a fragment using a weighted mean. The weights are based on local estimates of the degree of reverberation in a given spectro-temporal cell. The paper investigates different weight estimation approaches based variously on, i) an established model of the perceptual precedence effect; ii) a measure of interaural coherence between the left and right ear signals; iii) a data-driven approach trained in matched acoustic conditions. Experiments with reverberant binaural data with two simultaneous speakers show appropriate weighting can improve fram...
Heidi Christensen, Ning Ma, Stuart N. Wrigley, Jon
Added 21 May 2010
Updated 21 May 2010
Type Conference
Year 2009
Where ICASSP
Authors Heidi Christensen, Ning Ma, Stuart N. Wrigley, Jon Barker
Comments (0)