Sciweavers

ISMIR
2005
Springer

Finding An Optimal Segmentation for Audio Genre Classification

14 years 5 months ago
Finding An Optimal Segmentation for Audio Genre Classification
In the automatic classification of music many different segmentations of the audio signal have been used to calculate features. These include individual short frames (23 ms), longer frames (200 ms), short sliding textural windows (1 sec) of a stream of 23 ms frames, large fixed windows (10 sec) and whole files. In this work we present an evaluation of these different segmentations, showing that they are sub-optimal for genre classification and introduce the use of an onset detection based segmentation, which appears to outperform all of the fixed and sliding windows segmentation schemes in terms of classification accuracy and model size.
Kris West, Stephen Cox
Added 27 Jun 2010
Updated 27 Jun 2010
Type Conference
Year 2005
Where ISMIR
Authors Kris West, Stephen Cox
Comments (0)