Learning Structured Appearance Models from Captioned Images of Cluttered Scenes

16 years 9 months ago

Download www.cs.toronto.edu

Given an unstructured collection of captioned images of cluttered scenes featuring a variety of objects, our goal is to learn both the names and appearances of the objects. Only a small number of local features within any given image are associated with a particular caption word. We describe a connected graph appearance model where vertices represent local features and edges encode spatial relationships. We use the repetition of feature neighborhoods across training images and a measure of correspondence with caption words to guide the search for meaningful feature configurations. We demonstrate improved results on a dataset to which an unstructured object model was previously applied. We also apply the new method to a more challenging collection of captioned images from the web, detecting and annotating objects within highly cluttered realistic scenes.

Michael Jamieson, Afsaneh Fazly, Sven J. Dickinson

Real-time Traffic

Cluttered Scenes | Computer Vision | Graph Appearance Model | ICCV 2007 | Meaningful Feature Configurations | Particular Caption Word | Unstructured Object Model |

claim paper

» Using Language to Learn Structured Appearance Models for Image Annotation

» D Clutter Building object model library from unsupervised segmentation of cluttered scene...

» Discriminative Learning with Latent Variables for Cluttered Indoor Scene Understanding

» Depth from Familiar Objects A Hierarchical Model for 3D Scenes

» Geometric Context from a Single Image

» Learning Hierarchical Models of Scenes Objects and Parts

» General object tracking with a componentbased target descriptor

» VisionBased 3D Object Localization Using Probabilistic Models of Appearance

Post Info
More Details (n/a)

Added	14 Oct 2009
Updated	14 Oct 2009
Type	Conference
Year	2007
Where	ICCV
Authors	Michael Jamieson, Afsaneh Fazly, Sven J. Dickinson, Suzanne Stevenson, Sven Wachsmuth

Comments (0)

Sciweavers

Learning Structured Appearance Models from Captioned Images of Cluttered Scenes

Cluttered Scenes | Computer Vision | Graph Appearance Model | ICCV 2007 | Meaningful Feature Configurations | Particular Caption Word | Unstructured Object Model |

Explore & Download

Productivity Tools

Sciweavers