Sciweavers

MM
2015
ACM

Learning Combinations of Multiple Feature Representations for Music Emotion Prediction

8 years 7 months ago
Learning Combinations of Multiple Feature Representations for Music Emotion Prediction
Music consists of several structures and patterns evolving through time which greatly influences the human decoding of higher-level cognitive aspects of music like the emotions expressed in music. For tasks, such as genre, tag and emotion recognition, these structures have often been identified and used as individual and non-temporal features and representations. In this work, we address the hypothesis whether using multiple temporal and non-temporal representations of different features is beneficial for modeling music structure with the aim to predict the emotions expressed in music. We test this hypothesis by representing temporal and non-temporal structures using generative models of multiple audio features. The representations are used in a discriminative setting via the Product Probability Kernel and the Gaussian Process model enabling Multiple Kernel Learning, finding optimized combinations of both features and temporal/ non-temporal representations. We show the increased p...
Jens Madsen, Bjørn Sand Jensen, Jan Larsen
Added 14 Apr 2016
Updated 14 Apr 2016
Type Journal
Year 2015
Where MM
Authors Jens Madsen, Bjørn Sand Jensen, Jan Larsen
Comments (0)