Grounding spatial prepositions for video search

16 years 1 months ago

Download web.media.mit.edu

Spatial language video retrieval is an important real-world problem that forms a test bed for evaluating semantic structures for natural language descriptions of motion on naturalistic data. Video search by natural language query requires that linguistic input be converted into structures that operate on video in order to ﬁnd clips that match a query. This paper describes a framework for grounding the meaning of spatial prepositions in video. We present a library of features that can be used to automatically classify a video clip based on whether it matches a natural language query. To evaluate these features, we collected a corpus of natural language descriptions about the motion of people in video clips. We characterize the language used in the corpus, and use it to train and test models for the meanings of the spatial prepositions “to,” “across,” “through,” “out,” “along,” “towards,” and “around.” The classiﬁers can be used to build a spatial languag...

Stefanie Tellex, Deb Roy

Real-time Traffic

Biometrics | ICMI 2009 | Language Video Retrieval | Natural Language | Spatial Language Video |

claim paper

Added	26 May 2010
Updated	26 May 2010
Type	Conference
Year	2009
Where	ICMI
Authors	Stefanie Tellex, Deb Roy

Sciweavers

Grounding spatial prepositions for video search

Biometrics | ICMI 2009 | Language Video Retrieval | Natural Language | Spatial Language Video |

Explore & Download

Productivity Tools

Sciweavers