A speech fragment approach to localising multiple speakers in reverberant environments

14 years 9 months ago

Download perception.inrialpes.fr

Sound source localisation cues are severely degraded when multiple acoustic sources are active in the presence of reverberation. We present a binaural system for localising simultaneous speakers which exploits the fact that in a speech mixture there exist spectro-temporal regions or ‘fragments’, where the energy is dominated by just one of the speakers. A fragment-level localisation model is proposed that integrates the localisation cues within a fragment using a weighted mean. The weights are based on local estimates of the degree of reverberation in a given spectro-temporal cell. The paper investigates different weight estimation approaches based variously on, i) an established model of the perceptual precedence effect; ii) a measure of interaural coherence between the left and right ear signals; iii) a data-driven approach trained in matched acoustic conditions. Experiments with reverberant binaural data with two simultaneous speakers show appropriate weighting can improve fram...

Heidi Christensen, Ning Ma, Stuart N. Wrigley, Jon

Real-time Traffic

ICASSP 2009 | Localisation Cues | Signal Processing | Simultaneous Speakers | Source Localisation Cues |

claim paper

Post Info
More Details (n/a)

Added	21 May 2010
Updated	21 May 2010
Type	Conference
Year	2009
Where	ICASSP
Authors	Heidi Christensen, Ning Ma, Stuart N. Wrigley, Jon Barker

Comments (0)

Sciweavers

A speech fragment approach to localising multiple speakers in reverberant environments

ICASSP 2009 | Localisation Cues | Signal Processing | Simultaneous Speakers | Source Localisation Cues |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers