Sciweavers

CVPR
2007
IEEE

Harmony in Motion

15 years 2 months ago
Harmony in Motion
Cross-modal analysis offers information beyond that extracted from individual modalities. Consider a camcorder having a single microphone in a cocktail-party: it captures several moving visual objects which emit sounds. A task for audio-visual analysis is to identify the number of independent audio-associated visual objects (AVOs), pinpoint the AVOs' spatial locations in the video and isolate each corresponding audio component. Part of these problems were considered by prior studies, which were limited to simple cases, e.g., a single AVO or stationary sounds. We describe an approach that seeks to overcome these challenges. It acknowledges the importance of temporal features that are based on significant changes in each modality. A probabilistic formalism identifies temporal coincidences between these features, yielding cross-modal association and visual localization. This association is of particular benefit in harmonic sounds, as it enables subsequent isolation of each audio sou...
Zohar Barzelay, Yoav Y. Schechner
Added 12 Oct 2009
Updated 12 Oct 2009
Type Conference
Year 2007
Where CVPR
Authors Zohar Barzelay, Yoav Y. Schechner
Comments (0)