Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum