Sciweavers

ICML
2010
IEEE

3D Convolutional Neural Networks for Human Action Recognition

14 years 1 months ago
3D Convolutional Neural Networks for Human Action Recognition
We consider the fully automated recognition of actions in uncontrolled environment. Most existing work relies on domain knowledge to construct complex handcrafted features from inputs. In addition, the environments are usually assumed to be controlled. Convolutional neural networks (CNNs) are a type of deep models that can act directly on the raw inputs, thus automating the process of feature construction. However, such models are currently limited to handle 2D inputs. In this paper, we develop a novel 3D CNN model for action recognition. This model extracts features from both spatial and temporal dimensions by performing 3D convolutions, thereby capturing the motion information encoded in multiple adjacent frames. The developed model generates multiple channels of information from the input frames, and the final feature representation is obtained by combining information from all channels. We apply the developed model to recognize human actions in real-world environment, and it achie...
Shuiwang Ji, Wei Xu, Ming Yang, Kai Yu
Added 09 Nov 2010
Updated 09 Nov 2010
Type Conference
Year 2010
Where ICML
Authors Shuiwang Ji, Wei Xu, Ming Yang, Kai Yu
Comments (0)