Speech reading, also known as lip reading, is aimed at extracting visual cues of lip and facial movements to aid in recognition of speech. The main hurdle for speech reading is th...
This paper exploits the context of natural dynamic scenes
for human action recognition in video. Human actions
are frequently constrained by the purpose and the physical
propert...
Marcin Marszalek (INRIA), Ivan Laptev (INRIA), Cor...
This paper addresses human pose recognition from video sequences by formulating it as a classification problem. Unlike much previous work we do not make any assumptions on the ava...
Abstract. This paper presents and investigates a set of local spacetime descriptors for representing and recognizing motion patterns in video. Following the idea of local features ...
We describe a framework for robot navigation that exploits the continuity of image sequences. Tracked visual features both guide the robot and provide predictive information about...