Musical query-by-description as a multiclass learning problem

14 years 5 months ago

Download alumni.media.mit.edu

Abstract—We present the query-by-description (QBD) component of “Kandem,” a time-aware music retrieval system. The QBD system we describe learns a relation between descriptive text concerning a musical artist and their actual acoustic output, making such queries as “Play me something loud with an electronic beat” possible by merely analyzing the audio content of a database. We show a novel machine learning technique based on Regularized Least-Squares Classiﬁcation (RLSC) that can quickly and efﬁciently learn the non-linear relation between descriptive language and audio features by treating the problem as a large number of possible output classes linked to the same set of input features. We show how the RLSC training can easily eliminate irrelevant labels.

Brian Whitman, Ryan M. Rifkin

Real-time Traffic

Actual Acoustic Output | IEEEMSP 2002 | Multimedia | Music Retrieval System | Regularized Least-Squares Classiﬁcation |

claim paper

Post Info
More Details (n/a)

Added	15 Jul 2010
Updated	15 Jul 2010
Type	Conference
Year	2002
Where	IEEEMSP
Authors	Brian Whitman, Ryan M. Rifkin

Comments (0)

Sciweavers

Musical query-by-description as a multiclass learning problem

Actual Acoustic Output | IEEEMSP 2002 | Multimedia | Music Retrieval System | Regularized Least-Squares Classiﬁcation |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers