Improving the filter bank of a classic speech feature extraction algorithm

14 years 8 months ago

Download www.cnel.ufl.edu

The most popular speech feature extractor used in automatic speech recognition (ASR) systems today is the mel frequency cepstral coefﬁcient (mfcc) algorithm. Introduced in 1980, the ﬁlter bank-based algorithm eventually replaced linear prediction cepstral coefﬁcients (lpcc) as the premier front end, primarily because of mfcc’s superior robustness to additive noise. However, mfcc does not approximate the critical bandwidth of the human auditory system. We propose a novel scheme for decoupling ﬁlter bandwidth from other ﬁlter bank parameters, and we demonstrate improved noise robustness over three versions of mfcc through HMMbased experiments with the English digits in various noise environments.

Mark D. Skowronski, John G. Harris

Real-time Traffic

Cepstral Coefﬁcients | ISCAS 2003 | Mfcc’s Superior Robustness | Prediction Cepstral Coefﬁcients |

claim paper

Post Info
More Details (n/a)

Added	04 Jul 2010
Updated	04 Jul 2010
Type	Conference
Year	2003
Where	ISCAS
Authors	Mark D. Skowronski, John G. Harris

Comments (0)

Sciweavers

Improving the filter bank of a classic speech feature extraction algorithm

Cepstral Coefﬁcients | ISCAS 2003 | Mfcc’s Superior Robustness | Prediction Cepstral Coefﬁcients |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers