Sciweavers

CVPR
2007
IEEE

Visual Event Recognition in News Video using Kernel Methods with Multi-Level Temporal Alignment

14 years 5 months ago
Visual Event Recognition in News Video using Kernel Methods with Multi-Level Temporal Alignment
In this work, we systematically study the problem of visual event recognition in unconstrained news video sequences. We adopt the discriminative kernel-based method for which video clip similarity plays an important role. First, we represent a video clip as a bag of orderless descriptors extracted from all of the constituent frames and apply Earth Mover’s Distance (EMD) to integrate similarities among frames from two clips. Observing that a video clip is usually comprised of multiple sub-clips corresponding to event evolution over time, we further build a multilevel temporal pyramid. At each pyramid level, we integrate the information from different sub-clips with Integer-valueconstrained EMD to explicitly align the sub-clips. By fusing the information from the different pyramid levels, we develop Temporally Aligned Pyramid Matching (TAPM) for measuring video similarity. We conduct comprehensive experiments on the Trecvid 2005 corpus, which contains more than 6,800 clips. Our experi...
Dong Xu, Shih-Fu Chang
Added 02 Jun 2010
Updated 02 Jun 2010
Type Conference
Year 2007
Where CVPR
Authors Dong Xu, Shih-Fu Chang
Comments (0)