Most methods for multiple object tracking in video represent the state of multi-objects in a high dimensional joint state space. This leads to high computational complexity. This paper presents a method using the probability hypothesis density (PHD) filter to estimate the state of multiple objects in video. The method operates on the single object state space instead of the joint state space. A PHD recursion for visual observations with color measurements is proposed. Our method can track varying number of objects.