In this work we propose an approach to combine audio and video modalities for person tracking using graphical models. We demonstrate a principled and intuitive framework for combi...
Akash Kushal, Mandar Rahurkar, Fei-Fei Li 0002, Je...
Bayesian networks are an attractive modeling tool for human sensing, as they combine an intuitive graphical representation with ef?cient algorithms for inference and learning. Ear...
Tanzeem Choudhury, James M. Rehg, Vladimir Pavlovi...
People can understand complex auditory and visual information, often using one to disambiguate the other. Automated analysis, even at a lowlevel, faces severe challenges, includin...
John W. Fisher III, Trevor Darrell, William T. Fre...
Abstract - In this paper, we have developed not only a probabilistic sound localization system including VAD (Voice Activity Detection) component using three microphones but also a...
This paper presents the Athens Information Technology system for 3D person tracking and the obtained results in the CLEAR 2007 evaluations. The system utilizes audiovisual informat...