Efficient methods of content characterization for the browsing, retrieval or filtering of vast amount of digital video content has become a necessity. Still, there is a gap between the computationally available measures of content characteristics and the semantic interpretations of these characteristics. We want to establish connections between motion activity characteristics of video segments and the semantic characterization of them. For this purpose, two simple descriptors for motion activity of a video content is used to infer high-level semantic features of video in certain contexts. One of these descriptors, monotonous activity, is defined as the average block-based motion vector magnitude. The second descriptor, nonmonotonous activity, is an approximation to the average temporal derivative of motion vectors. Simulation results for browsing and retrieval applications show that by using the two measures together, object motions that occur close to the camera can be distinguished ...
Kadir A. Peker, A. Aydin Alatan, Ali N. Akansu