Semantic Annotation and Retrieval of Music and Sound Effects

14 years 22 days ago

Download cosmal.ucsd.edu

We present a computer audition system that can both annotate novel audio tracks with semantically meaningful words and retrieve relevant tracks from a database of unlabeled audio content given a text-based query. We consider the related tasks of content-based audio annotation and retrieval as one supervised multiclass, multilabel problem in which we model the joint probability of acoustic features and words. We collect a data set of 1700 human-generated annotations that describe 500 Western popular music tracks. For each word in a vocabulary, we use this data to train a Gaussian mixture model (GMM) over an audio feature space. We estimate the parameters of the model using the weighted mixture hierarchies expectation maximization algorithm. This algorithm is more scalable to large data sets and produces better density estimates than standard parameter estimation techniques. The quality of the music annotations produced by our system is comparable with the performance of humans on the sa...

Douglas Turnbull, Luke Barrington, D. Torres, Gert

Real-time Traffic

Content-based Audio Annotation | Gaussian Mixture Model | TASLP 2008 | Weighted Mixture Hierarchies |

claim paper

Post Info
More Details (n/a)

Added	15 Dec 2010
Updated	15 Dec 2010
Type	Journal
Year	2008
Where	TASLP
Authors	Douglas Turnbull, Luke Barrington, D. Torres, Gert R. G. Lanckriet

Comments (0)

Sciweavers

Semantic Annotation and Retrieval of Music and Sound Effects

Content-based Audio Annotation | Gaussian Mixture Model | TASLP 2008 | Weighted Mixture Hierarchies |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers