Action detection was formulated as a subvolume mutual information maximization problem in [8], where each subvolume identifies where and when the action occurs in the video. Desp...
Network impairments are unpredictable and highly destructive to the perceptual quality of media content in the IPTV content distribution networks. As a result the existing network...
This paper describes multi-user autostereoscopic displays developed within the European Union-funded MUTED and HELIUM3D projects. These utilize head tracking in order to provide i...
Philip Surman, Rajwinder Singh Brar, Ian Sexton, K...
Speech processing is an important aspect of affective computing. Most research in this direction has focused on classifying emotions into a small number of categories. However, nu...
Dongrui Wu, Thomas D. Parsons, Emily Mower, Shrika...
Labeling persons appearing in video frames with names detected from the video transcript helps improving the video content identification and search task. We develop a face naming...
Phi The Pham, Marie-Francine Moens, Tinne Tuytelaa...
In this demo, we present three lean methods for real-time adaptation of live MPEG-2 video to limited and varying network bandwidth. Our methods use real-time resource management t...
In this paper, we propose an unsupervised segmentation algorithm for extracting moving objects/regions from compressed video using Markov Random Field (MRF) classification. First,...
Audio tags describe different types of musical information such as genre, mood, and instrument. This paper aims to automatically annotate audio clips with tags and retrieve releva...
Traditional 3D audio systems often have a limited sweet spot for the user to perceive 3D effects successfully. In this paper, we present a personal 3D audio system with loudspeake...