This paper describes a system which uses multiple visual processes to detect and track faces for video compression and transmission. The system is based on an architecture in whic...
Abstract. We propose a graph based method to improve the performance of person queries in large news video collections. The method benefits from the multi-modal structure of videos...
A computer vision system for tracking multiple people in relatively unconstrained environments is described. Trackerformed at three levels of abstraction: regions, people and grou...
Stephen J. McKenna, Sumer Jabri, Zoran Duric, Harr...
Our goal is to segment multiple interacting and deforming agents in a video. Detectors often fail under large body deformation or agent entanglement. On the other hand, segmenting...
This paper describes a new approach to combine multiple modalities and applies it to the problem of affect recognition. The problem is posed as a combination of classifiers in a p...