This paper presents a multi-modal approach to locate a speaker in a scene and determine to whom he or she is speaking. We present a simple probabilistic framework that combines mu...
Michael Siracusa, Louis-Philippe Morency, Kevin Wi...
There is general consensus that context can be a rich source of information about an object's identity, location and scale. However, the issue of how to formalize contextual ...