This paper proposes two DSs to describe the visual information of an AV document. The first one, is devoted to still images. It describes the image visual appearance and its structure with regions as well as its semantic content in terms of objects. The second DS is devoted to video sequences. It describes the sequence structure as well as its semantic content in terms of events. Features such as motion, camera activity, etc. are included in this DS. Moreover, it involves static visual representations such as key-frames, background mosaics and keyregions. These elements are considered as still images and are described by the first DS.
Philippe Salembier, Noel E. O'Connor, Paulo Correi