Extension of Sparse, Adaptive Signal Decompositions to Semi-blind Audio Source Separation

16 years 2 months ago

Download www.elec.qmul.ac.uk

Abstract. We apply sparse, fast and ﬂexible adaptive lapped orthogonal transforms to underdetermined audio source separation using the time-frequency masking framework. This normally requires the sources to overlap as little as possible in the time-frequency plane. In this work, we apply our adaptive transform schemes to the semiblind case, in which the mixing system is already known, but the sources are unknown. By assuming that exactly two sources are active at each time-frequency index, we determine both the adaptive transforms and the estimated source coeﬃcients using 1 norm minimisation. We show average performance of 12–13 dB SDR on speech and music mixtures, and show that the adaptive transform scheme oﬀers improvements in the order of several tenths of a dB over transforms with constant block length. Comparison with previously studied upper bounds suggests that the potential for future improvements is signiﬁcant.

Andrew Nesbit, Emmanuel Vincent, Mark D. Plumbley

Real-time Traffic

Adaptive Transform | Adaptive Transform Scheme | Artificial Intelligence | IDA 2009 | Time-frequency Masking Framework |

claim paper

Post Info
More Details (n/a)

Added	26 May 2010
Updated	26 May 2010
Type	Conference
Year	2009
Where	IDA
Authors	Andrew Nesbit, Emmanuel Vincent, Mark D. Plumbley

Comments (0)

Sciweavers

Extension of Sparse, Adaptive Signal Decompositions to Semi-blind Audio Source Separation

Adaptive Transform | Adaptive Transform Scheme | Artificial Intelligence | IDA 2009 | Time-frequency Masking Framework |

Explore & Download

Productivity Tools

Sciweavers