In this paper, we present a coordinated video surveillance system that can minimize the spatial limitation and can precisely extract the 3D position of objects. To do this, our system used an agent based system and also tracked the normalized object using active wide-baseline stereo method. The system is composed of two parts: multiple camera agents (CAs) and a support module (SM). Each CA treats image processing and camera controlling. A SM performs a role that manages communication between CAs. Our proposed system extracts object positions independent of environment via the collaboration of CAs and a SM. Finally, through experimental results we show that the proposed system successfully tracks an object on real-time.