Sciweavers

ICPR
2006
IEEE

Audio-Visual Speaker Localization Using Graphical Models

15 years 28 days ago
Audio-Visual Speaker Localization Using Graphical Models
In this work we propose an approach to combine audio and video modalities for person tracking using graphical models. We demonstrate a principled and intuitive framework for combining these modalities to obtain robustness against occlusion and change in appearance. We further exploit the temporal correlations that exist for a moving object between adjacent frames to account for the cases where having both modalities might still not be enough, e.g., when the person being tracked is occluded and not speaking. Improvement in tracking results is shown at each step and compared with manually annotated ground truth.
Akash Kushal, Mandar Rahurkar, Fei-Fei Li 0002, Je
Added 09 Nov 2009
Updated 09 Nov 2009
Type Conference
Year 2006
Where ICPR
Authors Akash Kushal, Mandar Rahurkar, Fei-Fei Li 0002, Jean Ponce, Thomas S. Huang
Comments (0)