Detecting regions of interest in video sequences is the most important task in many high level video processing applications. In this paper a robust technique based on recursive learning of video background and foreground models is presented. The proposed modeling technique achieves a fast convergence speed and an adaptive, accurate background/foreground model. Our contributions can be described along four directions. First, a recursive learning scheme is developed to build the models based on colors of the pixels. Our second contribution is to generate background and foreground models to enforce the temporal consistency of detected foregrounds. Third, we exploit dependencies between pixel colors to insure that the model is not restricted to using only independent features. Finally, an adaptive pixel-wise criterion is proposed that incorporates different spatial situations in the scene. We also enforce spatial consistency of the pixels to rule out the effect of erroneously labeled for...