Navigation through large multimedia collections that include videos and images still remains a hard problem. In this paper, we introduce a novel method to visualize and navigate t...
We designed eSports—a collaborative and synchronous video annotation platform, which is to be used in Internet scale cross-platform grid computing environment to facilitate Comp...
Gang Zhai, Geoffrey Fox, Marlon E. Pierce, Wenjun ...
The spotting and recognition of the human gestures is a key task in automating the analysis of the video material and human-robot interaction. Specially, applying this technology ...
The development of mid-level concepts helps to bridge the gap between low-level feature and high-level semantics in video analysis. Most existing work combines the customized mid-...
Grounded language models represent the relationship between words and the non-linguistic context in which they are said. This paper describes how they are learned from large corpo...