Phoneme selective speech enhancement using the generalized parametric spectral subtraction estimator

13 years 6 months ago

Download mirlab.org

In this study, the generalized parametric spectral subtraction estimator is employed in the context of a ROVER speech enhancement framework to develop a robust phoneme class selective enhancement algorithm. The parametric estimator is derived by a) optimizing the weighted Euclidean distortion cost function and b) by modeling clean speech spectral magnitudes as Rayleigh distributed priors. A set of enhanced utterances are generated from a single noisy utterance by tuning the parameters of the parametric estimator for different phoneme classes. The speech and non-speech segments are segregated using a voice activity detector. Thereafter, the mixture maximum model is used to make soft decisions on these segments to determine their phoneme class weights. The segments from the enhanced utterances are weighted by these decisions and combined to form the ﬁnal composite utterance. Using segmental SNR and Itakura-Saito metrics over two noise types and four SNR levels, it was demonstrated tha...

Amit Das, John H. L. Hansen

Real-time Traffic

ICASSP 2011 | Parametric Estimator | Phoneme Class | Signal Processing | Utterances |

claim paper

Post Info
More Details (n/a)

Added	20 Aug 2011
Updated	20 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Amit Das, John H. L. Hansen

Comments (0)

Sciweavers

Phoneme selective speech enhancement using the generalized parametric spectral subtraction estimator

ICASSP 2011 | Parametric Estimator | Phoneme Class | Signal Processing | Utterances |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers