This paper presents a method for automatically annotating and retrieving animal images. Our model is a multi-modality ontology extended from our previous works in the sense that b...
A well-built dataset is a necessary starting point for advanced computer vision research. It plays a crucial role in evaluation and provides a continuous challenge to stateof-the-...
Abstract. The complexity of visual representations is substantially limited by the compositional nature of our visual world which, therefore, renders learning structured object mod...
Appropriate datasets are required at all stages of object recognition research, including learning visual models of object and scene categories, detecting and localizing instances ...
Jean Ponce, Tamara L. Berg, Mark Everingham, David...
In this paper, we propose a multimodal system for detecting human activity and interaction patterns in a nursing home. Activities of groups of people are firstly treated as intera...