Improving acoustic event detection using generalizable visual features and multi-modality modeling

14 years 6 months ago

Download mirlab.org

Acoustic event detection (AED) aims to identify both timestamps and types of multiple events and has been found to be very challenging. The cues for these events often times exist in both audio and vision, but not necessarily in a synchronized fashion. We study improving the detection and classiﬁcation of the events using cues from both modalities. We propose optical ﬂow based spatial pyramid histograms as a generalizable visual representation that does not require training on labeled video data. Hidden Markov models (HMMs) are used for audio-only modeling, and multi-stream HMMs or coupled HMMs (CHMM) are used for audio-visual joint modeling. To allow the ﬂexibility of audio-visual state asynchrony, we explore effective CHMM training via HMM state-space mapping, parameter tying and different initialization schemes. The proposed methods successfully improve acoustic event classiﬁcation and detection on a multimedia meeting room dataset containing eleven types of general non-spe...

Po-Sen Huang, Xiaodan Zhuang, Mark Hasegawa-Johnso

Real-time Traffic

Acoustic Event | Acoustic Event Detection | Audio-visual Joint Modeling | ICASSP 2011 | Signal Processing |

claim paper

» Classification of video events using 4dimensional timecompressed motion features

» Audiovisual sports highlights extraction using Coupled Hidden Markov Models

» Efficient Sparse 3D Reconstruction by Space Sweeping

» Localizing volumetric motion for action recognition in realistic videos

Post Info
More Details (n/a)

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Po-Sen Huang, Xiaodan Zhuang, Mark Hasegawa-Johnson

Comments (0)

Sciweavers

Improving acoustic event detection using generalizable visual features and multi-modality modeling

Acoustic Event | Acoustic Event Detection | Audio-visual Joint Modeling | ICASSP 2011 | Signal Processing |

Explore & Download

Productivity Tools

Sciweavers