In this paper we propose an attention-based vision system for the JAST interactive dialog robot. The robotic vision system incorporates three submodules: object recognition, gestu...
We present a higher-level visual representation, visual synset, for object categorization. The visual synset improves the traditional bag of words representation with better discr...
Yantao Zheng, Ming Zhao 0003, Shi-Yong Neo, Tat-Se...
A typical way to perform video annotation requires to classify video elements (e.g. events and objects) according to some pre-defined ontology of the video content domain. Ontolo...
In this paper, a novel genetically-inspired visual learning method is proposed. Given the training images, this general approach induces a sophisticated feature-based recognition s...
In this paper, we present our work for automatic generation of textual metadata based on visual content analysis of video news. We present two methods for semantic object detectio...