This paper describes an approach for video structuring and indexing. It relies on motion wavelet coefficients directly estimated from image sequence. These coefficients provide a multiscale characterization of optical flow. They allow to define dominant and local motion descriptors, respectively related to camera and object displacements. We use dominant motion descriptors to perform a temporal segmentation of the sequence. Shots extracted are characterized in term of dominant motion properties and indexed by using descriptors related to local motion content. These operations allow to retrieve shots, by example queries, according to only dynamic content of the scene and not camera displacements.