Combining different and complementary object models promises to increase the robustness and generality of today’s computer vision algorithms. This paper introduces a new method ...
For scene classification, patch-level linear features do not always work as well as handcrafted features. In this paper, we present a new model to greatly improve the usefulness ...
Liwei Wang, Yin Li, Jiaya Jia, Jian Sun, David Wip...
We describe a mid-level approach for action recognition. From an input video, we extract salient spatio-temporal structures by forming clusters of trajectories that serve as candi...
We present a unified occlusion model for object instance detection under arbitrary viewpoint. Whereas previous approaches primarily modeled local coherency of occlusions or attem...
Despite significant recent progress, the best available visual saliency models still lag behind human performance in predicting eye fixations in free-viewing of natural scenes. ...