Grouplet: a Structured Image Representation for Recognizing Human and Object Interactions

14 years 9 months ago

Download vision.stanford.edu

Psychologists have proposed that many human-object interaction activities form unique classes of scenes. Recognizing these scenes is important for many social functions. To enable a computer to do this is however a challenging task. Take people-playing-musical-instrument (PPMI) as an example; to distinguish a person playing violin from a person just holding a violin requires subtle distinction of characteristic image features and feature arrangements that differentiate these two scenes. Most of the existing image representation methods are either too coarse (e.g. BoW) or too sparse (e.g. constellation models) for performing this task. In this paper, we propose a new image feature representation called “grouplet”. The grouplet captures the structured information of an image by encoding a number of discriminative visual features and their spatial conﬁgurations. Using a dataset of 7 different PPMI activities, we show that grouplets are more effective in classifying and detecting hu...

Bangpeng Yao, Li Fei-Fei

Real-time Traffic

Computer Vision | CVPR 2010 | Human-object Interactions | Image Feature | Person Playing Violin |

claim paper

Post Info
More Details (n/a)

Added	03 Apr 2010
Updated	14 May 2010
Type	Conference
Year	2010
Where	CVPR
Authors	Bangpeng Yao, Li Fei-Fei

Comments (0)

Sciweavers

Grouplet: a Structured Image Representation for Recognizing Human and Object Interactions

Computer Vision | CVPR 2010 | Human-object Interactions | Image Feature | Person Playing Violin |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers