Minimum Mean-Squared Error Estimation of Mel-Frequency Cepstral Coefficients Using a Novel Distortion Model

15 years 16 days ago

Download povinelli.eece.mu.edu

In this paper, a new method for statistical estimation of Mel-frequency cepstral coefficients (MFCCs) in noisy speech signals is proposed. Previous research has shown that model-based feature domain enhancement of speech signals for use in robust speech recognition can improve recognition accuracy significantly. These methods, which typically work in the log spectral or cepstral domain, must face the high complexity of distortion models caused by the nonlinear interaction of speech and noise in these domains. In this paper, an additive cepstral distortion model (ACDM) is developed, and used with a minimum mean-squared error (MMSE) estimator for recovery of MFCC features corrupted by additive noise. The proposed ACDM-MMSE estimation algorithm is evaluated on the Aurora2 database, and is shown to provide significant improvement in word recognition accuracy over the baseline.

Kevin M. Indrebo, Richard J. Povinelli, Michael T.

Real-time Traffic

Cepstral | Distortion Model | Recognition Accuracy | TASLP 2008 |

claim paper

Added	28 Jan 2011
Updated	28 Jan 2011
Type	Journal
Year	2008
Where	TASLP
Authors	Kevin M. Indrebo, Richard J. Povinelli, Michael T. Johnson

Sciweavers

Minimum Mean-Squared Error Estimation of Mel-Frequency Cepstral Coefficients Using a Novel Distortion Model

Cepstral | Distortion Model | Recognition Accuracy | TASLP 2008 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers