This paper proposes a new method for bimodal information fusion in audio-visual speech recognition, where cross-modal association is considered in two levels. First, the acoustic a...
Varying illumination and partial occlusion are two main difficulties in visual tracking. Existing methods based on appearance information cannot solve these problems effectively s...
Abstract. This paper presents a novel pixelwise representation for visual tracking that models both the spatial structure and dynamics of a target in a unified fashion. The represe...
Graphical virtual worlds are increasingly significant sites of collaborative interaction. Many argue that the simulation of the everyday environment makes them particularly effect...
We propose a visual recognition system that is designed for fine-grained visual categorization. The system is composed of a machine and a human user. The user, who is unable to c...
Catherine Wah, Steven Branson, Pietro Perona, Serg...