This paper describes a new representation for the audio and visual information in a video signal. We reduce the dimensionality of the signals with singular-value decompositions (S...
Mismatch between training and testing data is a major error source for both Automatic Speech Recognition (ASR) and Automatic Speaker Identification (ASI). In this paper, we first ...
Xi Zhou, Yun Fu, Ming Liu, Mark Hasegawa-Johnson, ...