Hierarchical audio classification using cepstral modulation ratio regressions based on Legendre polynomials

13 years 12 months ago

Download mirlab.org

In this work we present a scalable feature set which is obtained by ﬁtting orthogonal polynomials to the normalized modulation spectrum of cepstral coefﬁcients and which can be easily adapted to different classiﬁcation tasks. The performance of the feature set is investigated in a hierarchically structured audio signal classiﬁcation experiment and compared with other approaches reported in the literature. For the root categories speech, music and noise a classiﬁcation accuracy of 95% is achieved. Subclasses such as male and female speech or different noise types are classiﬁed with an accuracy of 95% and 85%, respectively. In a 10-category musical genre discrimination experiment the proposed features exhibit an accuracy of 61%.

Anil M. Nagathil, Peter Gottel, Rainer Martin

Real-time Traffic

10-category Musical Genre | ICASSP 2011 | Normalized Modulation Spectrum | Root Categories Speech | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	20 Aug 2011
Updated	20 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Anil M. Nagathil, Peter Gottel, Rainer Martin

Comments (0)

Sciweavers

Hierarchical audio classification using cepstral modulation ratio regressions based on Legendre polynomials

10-category Musical Genre | ICASSP 2011 | Normalized Modulation Spectrum | Root Categories Speech | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers