In this paper, we propose an approach to collaboratively track motion of a moving target in a wide area utilizing camera-equipped visual sensor networks, which are expected to play an essential role in a variety of applications such as surveillance and monitoring. A genetic fitting method for efficient contour extraction is used as inter-scene approach to detect and track the target. We also considered the existence of faulty sensors in the network which deteriorate the difficulty of target tracking problem, and proposed a robust sensor collaboration method. The experimental results have shown that the proposed target tracking approach produces very successful target tracking compared with the existing method especially in case that the target is adjacent to neighboring objects of background.