— This paper investigates spatiotemporal feature extraction from temporal image sequences based on invariance representation. Invariance representation is one of important functions of the visual cortex. We propose a novel hierarchical model based on invariance and independent component analysis for spatiotemporal feature extraction. Training the model from patches sampled from natural scenes, we can obtain image basis with properties of translational, scaling, and rotational features. Further experiments on TV videos and facial image sequences show different characteristics of spatiotemporal features are achieved by training the proposed model. All these computer simulations verify that our proposed model is successful for spatiotemporal feature extraction.