A non-negative approach to semi-supervised separation of speech from noise with the use of temporal dynamics

13 years 7 months ago

Download mirlab.org

We present a semi-supervised source separation methodology to denoise speech by modeling speech as one source and noise as the other source. We model speech using the recently proposed non-negative hidden Markov model, which uses multiple non-negative dictionaries and a Markov chain to jointly model spectral structure and temporal dynamics of speech. We perform separation of the speech and noise using the recently proposed non-negative factorial hidden Markov model. Although the speech model is learned from training data, the noise model is learned during the separation process and requires no training data. We show that the proposed method achieves superior results to using non-negative spectrogram factorization, which ignores the non-stationarity and temporal dynamics of speech.

Gautham J. Mysore, Paris Smaragdis

Real-time Traffic

Hidden Markov Model | ICASSP 2011 | Non-negative | Non-negative Hidden Markov | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	20 Aug 2011
Updated	20 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Gautham J. Mysore, Paris Smaragdis

Comments (0)

Sciweavers

A non-negative approach to semi-supervised separation of speech from noise with the use of temporal dynamics

Hidden Markov Model | ICASSP 2011 | Non-negative | Non-negative Hidden Markov | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers