Sciweavers

ICMCS
2005
IEEE

Gaussian Mixture Modeling Using Short Time Fourier Transform Features for Audio Fingerprinting

14 years 5 months ago
Gaussian Mixture Modeling Using Short Time Fourier Transform Features for Audio Fingerprinting
In audio fingerprinting, an audio clip must be recognized by matching an extracted fingerprint to a database of previously computed fingerprints. The fingerprints should reduce the dimensionality of the input significantly, provide discrimination among different audio clips, and at the same time, invariant to the distorted versions of the same audio clip. In this paper, we design fingerprints addressing the above issues by modeling an audio clip by Gaussian mixture models (GMM) using a wide range of easy-to-compute short time Fourier transform features such as Shannon entropy, Renyi entropy, spectral centroid, spectral bandwidth, spectral flatness measure, spectral crest factor, and Mel-frequency cepstral coefficients. We test the robustness of the fingerprints under a large number of distortions. To make the system robust, we use some of the distorted versions of the audio for training. However, we show that the audio fingerprints modeled using GMM are not only robust to th...
Arunan Ramalingam, Sridhar Krishnan
Added 24 Jun 2010
Updated 24 Jun 2010
Type Conference
Year 2005
Where ICMCS
Authors Arunan Ramalingam, Sridhar Krishnan
Comments (0)