Video Google: Efficient Visual Search of Videos

14 years 6 months ago

Download www.cse.unr.edu

We describe an approach to object retrieval which searches for and localizes all the occurrences of an object in a video, given a query image of the object. The object is represented by a set of viewpoint invariant region descriptors so that recognition can proceed successfully despite changes in viewpoint, illumination and partial occlusion. The temporal continuity of the video within a shot is used to track the regions in order to reject those that are unstable. Efficient retrieval is achieved by employing methods from statistical text retrieval, including inverted file systems, and text and document frequency weightings. This requires a visual analogy of a word which is provided here by vector quantizing the region descriptors. The final ranking also depends on the spatial layout of the regions. The result is that retrieval is immediate, returning a ranked list of shots in the manner of Google. We report results for object retrieval on the full length feature films `Groundhog Day�...

Josef Sivic, Andrew Zisserman

Real-time Traffic

CLOR 2006 | Computer Vision | Invariant Region Descriptors | Object Retrieval | Region Descriptors |

claim paper

Post Info
More Details (n/a)

Added	20 Aug 2010
Updated	20 Aug 2010
Type	Conference
Year	2006
Where	CLOR
Authors	Josef Sivic, Andrew Zisserman

Comments (0)

Sciweavers

Video Google: Efficient Visual Search of Videos

CLOR 2006 | Computer Vision | Invariant Region Descriptors | Object Retrieval | Region Descriptors |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers