This paper presents a framework to simultaneously segment and track multiple body parts of interacting humans in the presence of mutual occlusion and shadow. The framework uses multiple free-form blobs and a coarse model of the human body. The color image sequence is processed at three levels: pixel level, blob level, and object level. A Gaussian mixture model is used at the pixel level to train and classify individual pixel based on color. Relaxation labeling in an attribute relational graph (ARG) is used at the blob level to merge the pixels into coherent blobs and to represent inter-blob relations. A twofold tracking scheme is used that consists of blob-to-blob matching in consecutive frames and blob-to-body-part association within a frame. The tracking scheme resembles multi-target, multiassociation tracking (MMT). A coarse model of the human body is applied at the object level as empirical domain knowledge to resolve ambiguity due to occlusion and to recover from intermittent tra...
Sangho Park, Jake K. Aggarwal