Abstract— People in densely populated environments typically form groups that split and merge. In this paper we track groups of people so as to reflect this formation process an...
We present an experimental setup for real-time face identification in a cluttered scene. Color images of people are recorded with a static camera. A rough face detection is perfor...
Spatial language video retrieval is an important real-world problem that forms a test bed for evaluating semantic structures for natural language descriptions of motion on natural...
User feedback is widely deployed in recent multimedia research to refine retrieval performance. However, most of the existing online learning algorithms handle interactions of a s...
We present a 3-level hierarchical model for localizing human bodies in still images from arbitrary viewpoints. We first fit a simple tree-structured model defined on a small landm...
Jiayong Zhang, Jiebo Luo, Robert T. Collins, Yanxi...