—Network cameras, extensively used in video surveillance, often allow pan-tilt-zoom functionality and are also subject to wind load and mount vibrations, thus causing video frame misalignment. Although algorithms for motion detection, a basic block of most visual surveillance systems, are relatively mature for fixed cameras, they usually perform poorly for active and/or vibrating cameras. The issue is particularly severe for algorithms using multiple video frames jointly. In this paper, we extend our earlier work on multipleframe motion detection to the case of active and unstable cameras. Our method accounts for spatially-affine, inter-frame transformations that can vary in time, uses a variational formulation and applies a level-set solution. We present groundtruth and real-data experimental results and show significant improvements over earlier methods. Keywords-motion detection, multi-frame video analysis, level sets, affine motion