Despite their simplicity, scalar threshold operators effectively remove additive white Gaussian noise from wavelet detail coefficients of many practical signals. This paper explor...
Alyson K. Fletcher, Vivek K. Goyal, Kannan Ramchan...
In this paper, we investigate the use of the coupled hidden Markov models (CHMM) for the task of audio-visual text dependent speaker identification. Our system determines the iden...
Tieyan Fu, Xiao Xing Liu, Lu Hong Liang, Xiaobo Pi...
We present a probabilistic method for audio-visual (AV) speaker tracking, using an uncalibrated wide-angle camera and a microphone array. The algorithm fuses 2-D object shape and ...
Daniel Gatica-Perez, Guillaume Lathoud, Iain McCow...
Thispaper presents an information-theoreticstudy ofvideo codecs that are based on the principle of source coding with side information at the decoder In contrast to the classical ...
Prakash Ishwar, Vinod M. Prabhakaran, Kannan Ramch...
We extend a recently-proposed framework for the rate-distortion optimized transmission of packetized media. The original framework assumed that media packets each have a single arr...