Sciweavers

ICASSP
2010
IEEE

Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power floori

13 years 12 months ago
Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power floori
This paper presents a new robust feature extraction algorithm based on a modified approach to power bias subtraction combined with applying a threshold to the power spectral density. Power bias level is selected as a level above which the signal power distribution is sharpest. The sharpness is measured using the ratio of arithmetic mean to the geometric mean of medium-duration power. When subtracting this bias level, power flooring is applied to enhance robustness. These new ideas are employed to enhance our recently introduced feature extraction algorithm PNCC (Power Normalized Cepstral Coefficient). While simpler than our previous PNCC, experimental results show that this new PNCC is showing better performance than our previous implementation.
Chanwoo Kim, Richard M. Stern
Added 26 Jan 2011
Updated 26 Jan 2011
Type Journal
Year 2010
Where ICASSP
Authors Chanwoo Kim, Richard M. Stern
Comments (0)