In this paper an optimized and efficient technique for keyframes extraction of video sequences is proposed, which leads to selection of a meaningful set of video frames for each g...
We describe a novel technique for multi-sensory speech processing for enhancing noisy speech and for improved noiserobust speech recognition. Both air- and bone-conductive microph...
Amarnag Subramanya, Li Deng, Zicheng Liu, Zhengyou...
Music visualization provides users with a new interface to browse, search, and navigate their personal digital music collection. Although there are several previous works on visua...
This paper presents an automatic video editing system based on head tracking for archiving meetings. Systems that archive meetings are attracting considerable interest. Convention...
Finding faces in visually challenging environments is crucial to many applications, such as audio-visual automatic speech recognition, video indexing, person recognition, and vide...