This paper presents a new scheme for acquiring 3D kinematic structure and motion from time-series volume data, in particular, focusing on human body. Our basic strategy is to first represent the shape structure of the target in each frame by using aMRG, augmented Multiresolution Reeb Graph [6], and then deform each of the shape structures so that all of them can be identified as a common kinematic structure throughout the input frames. Although the shape structures can be very different from frame to frame, we propose to derive a unique kinematic structure by way of clustering some nodes of graph, based on the fact that they are partly coherent. The only assumption we make is that human body can be approximated by an articulated body with certain number of end-points and branches. We demonstrate the efficacy of the proposed scheme through some experiments.