Activity recognition in video is dominated by low- and mid-level features, and while demonstrably capable, by nature, these features carry little semantic meaning. Inspired by the...
In this paper we develop an algorithm for action recognition and localization in videos. The algorithm uses a figurecentric visual word representation. Different from previous ap...