Probabalistic Models and Informative Subspaces for Audiovisual Correspondence

15 years 3 months ago

Download groups.csail.mit.edu

Abstract. We propose a probabalistic model of single source multimodal generation and show how algorithms for maximizing mutual information can find the correspondences between components of each signal. We show how non-parametric techniques for finding informative subspaces can capture the complex statistical relationship between signals in different modalities. We extend a previous technique for finding informative subspaces to include new priors on the projection weights, yielding more robust results. Applied to human speakers, our model can find the relationship between audio speech and video of facial motion, and partially segment out background events in both channels. We present new results on the problem of audio-visual verification, and show how the audio and video of a speaker can be matched even when no prior model of the speaker's voice or appearance is available.

John W. Fisher III, Trevor Darrell

Real-time Traffic

Complex Statistical Relationship | Computer Vision | ECCV 2002 | Informative Subspaces | Prior Model | Probabalistic Model | Source Multimodal Generation |

claim paper

Post Info
More Details (n/a)

Added	16 Oct 2009
Updated	16 Oct 2009
Type	Conference
Year	2002
Where	ECCV
Authors	John W. Fisher III, Trevor Darrell

Comments (0)

Sciweavers

Probabalistic Models and Informative Subspaces for Audiovisual Correspondence

Complex Statistical Relationship | Computer Vision | ECCV 2002 | Informative Subspaces | Prior Model | Probabalistic Model | Source Multimodal Generation |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers