Sciweavers

ICASSP
2008
IEEE

Evaluation of several strategies for single sensor speech/music separation

14 years 6 months ago
Evaluation of several strategies for single sensor speech/music separation
In this paper we address the application of single sensor source separation techniques to mixtures of speech and music. Three strategies for source modeling are presented, namely Gaussian Scaled Mixture Models (GSMM), Autoregressive (AR) models and Amplitude Factor (AF). The common ingredient to the methods is the use of a codebook containing elementary spectral shapes to represent nonstationary signals, and to handle separately spectral shape and amplitude information. We propose a new system that employs separate models for the speech and music signals. The speech signal proves to be best modeled with the AR-based codebook, while the music signal is best modeled with the AF-based codebook. Experimental results demonstrate the improved performance of the proposed approach for speech/music separation in some evaluation criteria.
Raphaël Blouet, Guy Rapaport, Cédric F
Added 30 May 2010
Updated 30 May 2010
Type Conference
Year 2008
Where ICASSP
Authors Raphaël Blouet, Guy Rapaport, Cédric Févotte
Comments (0)