Expectation Grammars: Leveraging High-Level Expectations for Activity Recognition

16 years 4 months ago

Download wiki.cc.gatech.edu

Video-based recognition and prediction of a temporally extended activity can benefit from a detailed description of high-level expectations about the activity. Stochastic grammars allow for an efficient representation of such expectations and are well-suited for the specification of temporally well-ordered activities. In this paper, we extend stochastic grammars by adding event parameters, state checks, and sensitivity to an internal scene model. We present an implemented system that uses human-specified grammars to recognize a person performing the Towers of Hanoi task from a video sequence by analyzing object interaction events. Experimental results from several videos show robust recognition of the full task and its constituent sub-tasks even though no appearance models of the objects in the video are provided. These experiments include videos of the task performed with different shaped objects and with distracting and extraneous interactions.

David Minnen, Irfan A. Essa, Thad Starner

Real-time Traffic

Computer Vision | Constituent Sub-tasks | CVPR 2003 | Hanoi Task | Human-specified Grammars | Internal Scene Model | Stochastic Grammars |

claim paper

Post Info
More Details (n/a)

Added	12 Oct 2009
Updated	29 Oct 2009
Type	Conference
Year	2003
Where	CVPR
Authors	David Minnen, Irfan A. Essa, Thad Starner

Comments (0)

Sciweavers

Expectation Grammars: Leveraging High-Level Expectations for Activity Recognition

Computer Vision | Constituent Sub-tasks | CVPR 2003 | Hanoi Task | Human-specified Grammars | Internal Scene Model | Stochastic Grammars |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers