Shout detection in noise

13 years 4 months ago

Download cs.joensuu.fi

For the task of detecting shouted speech in a noisy environment, this paper introduces a system based on mel frequency cepstral coefﬁcient (MFCC) feature extraction, unsupervised frame dropping and Gaussian mixture model (GMM) classiﬁcation. The evaluation material consists of phonemically identical speech and shouting as well as environmental noise of varying levels. The performance of the shout detection system is analyzed by varying the MFCC feature extraction with respect to 1) the feature vector length and 2) the spectrum estimation method. As for feature vector length, the best performance is obtained using 30 MFCC coefﬁcients, which is more than what is conventionally used. In spectrum estimation, a scheme that combines a linear prediction spectrum envelope with spectral ﬁne structure outperforms the conventional FFT.

Jouni Pohjalainen, Paavo Alku, Tomi Kinnunen

Real-time Traffic

Feature Vector Length | ICASSP 2011 | MFCC Feature Extraction | Signal Processing | Spectrum Estimation |

claim paper

Post Info
More Details (n/a)

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Jouni Pohjalainen, Paavo Alku, Tomi Kinnunen

Comments (0)

Sciweavers

Shout detection in noise

Feature Vector Length | ICASSP 2011 | MFCC Feature Extraction | Signal Processing | Spectrum Estimation |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers