Evaluating Source Separation Algorithms With Reverberant Speech

13 years 10 months ago

Download cns-web.bu.edu

This paper examines the performance of several source separation systems on a speech separation task for which human intelligibility has previously been measured. For anechoic mixtures, automatic speech recognition (ASR) performance on the separated signals is quite similar to human performance. In reverberation, however, while signal separation has some benefit for ASR, the results are still far below those of human listeners facing the same task. Performing this same experiment with a number of oracle masks created with a priori knowledge of the separated sources motivates a new objective measure of separation performance, the Direct-path, Early echo, and Reverberation, of the Target and Masker (DERTM), which is closely related to the ASR results. This measure indicates that while the non-oracle algorithms successfully reject the direct-path signal from the masking source, they reject less of its reverberation, explaining the disappointing ASR performance.

Michael I. Mandel, S. Bressler, Barbara G. Shinn-C

Real-time Traffic

Disappointing Asr Performance | Performance | Reverberation | Software Engineering | TASLP 2010 |

claim paper

Post Info
More Details (n/a)

Added	21 May 2011
Updated	21 May 2011
Type	Journal
Year	2010
Where	TASLP
Authors	Michael I. Mandel, S. Bressler, Barbara G. Shinn-Cunningham, Daniel P. W. Ellis

Comments (0)

Sciweavers

Evaluating Source Separation Algorithms With Reverberant Speech

Disappointing Asr Performance | Performance | Reverberation | Software Engineering | TASLP 2010 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers