Soundtrack classification by transient events

13 years 4 months ago

Download www.ee.columbia.edu

We present a method for video classiﬁcation based on information in the soundtrack. Unlike previous approaches which describe the audio via statistics of mel-frequency cepstral coefﬁcient (MFCC) features calculated on uniformlyspaced frames, we investigate an approach to focusing our representation on audio transients corresponding to soundtrack events. These event-related features can reﬂect the “foreground” of the soundtrack and capture its short-term temporal structure better than conventional frame-based statistics. We evaluate our method on a test set of 1873 YouTube videos labeled with 25 semantic concepts. Retrieval results based on transient features alone are comparable to an MFCC-based system, and fusing the two representations achieves a relative improvement of 7.5% in mean average precision (MAP).

Courtenay V. Cotton, Daniel P. W. Ellis, Alexander

Real-time Traffic

ICASSP 2011 | Mean Average Precision | Short-term Temporal Structure | Signal Processing | Soundtrack |

claim paper

Post Info
More Details (n/a)

Added	20 Aug 2011
Updated	20 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Courtenay V. Cotton, Daniel P. W. Ellis, Alexander C. Loui

Comments (0)

Sciweavers

Soundtrack classification by transient events

ICASSP 2011 | Mean Average Precision | Short-term Temporal Structure | Signal Processing | Soundtrack |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers