Speaker diarization is originally defined as the task of determining “who spoke when” given an audio track and no other prior knowledge of any kind. The following article sho...
We present a system for the interactive navigation through high-resolution cylindrical panoramas. The system is based on MPEG-4 and describes the virtual world by the scene descri...
In this paper, we specifically propose the Weber-Fechner Law-based human attention model for semantic scene analysis in movies. Different from traditional video processing techniq...
Anan Liu, Yongdong Zhang, Yan Song, Dongming Zhang...
The use of video and audio features for automated annotation of audio-visual data is becoming widespread. A major limitation of many of the current methods is that the stored inde...
Kieron Messer, Josef Kittler, Barbara Levienaise-O...
This paper proposes a new approach for video stabilization.
Most existing video stabilization methods adopt
a framework of three steps, motion estimation, motion compensation
an...
Ken-Yi Lee, Yung-Yu, Chuang Bing-Yu, Chen Ming Ouh...