We propose a correlogram-based time delay estimation method using signals modeled as the output of the cochlea, where the low-level signal processing happens in the human auditory system. With a normalized correlogram that preserves time-delay patterns that are invariant to speech features such as formants, we employ two-dimensional template matching for time-delay estimation. Experimental results show that our method outperforms a traditional correlogram-based method as well as the GCC-PHAT, especially for short analysis windows in a moderately reverberant environment.
Bowon Lee, Ton Kalker, Ronald W. Schafer