Video surveillance systems are only as good as their ability to capture events of interest. The automatic detection of acoustic events of interest coupled with steerable cameras greatly increases the ability of surveillance cameras to capture relevant data. This paper describes an approach for retrofitting this capability to existing surveillance camera networks.