In this paper, we introduce an elaborate utterance detection algorithm to enhance speaker segmentation. Silence detector, further divider and audio type classifier are employed i...
We present the ZoomSlider, a new interface for skimming and browsing video content in a flexible and interactive way. It circumvents common problems of existing video browsing app...
This paper describes a novel approach to the presentation of results from web-search by utilizing the semantic correlations amongst the retrieved results as well as their spatio-t...
Rahul Singh, Ya-Wen Hsu, Wen-Cheng Sun, Dil Chitau...
In this paper, we propose a semantic routing and filtering framework for large-scale monitoring of video streams. Our goal is to build a distributed system that at any given time ...
In this research, an architecture that performs both forward and inverse lifting-based discrete wavelet transform is proposed. The proposed architecture reduces the hardware requi...
Video management research has largely been ignoring the increased attractiveness of using camera-equipped mobile phones for the production of short home video clips, mostly consid...
In this paper, we propose a multi-buffer scheduling scheme for streaming video systems. A transmission rate is obtained via a rate control algorithm, which optimally utilizes the ...
Finding faces in visually challenging environments is crucial to many applications, such as audio-visual automatic speech recognition, video indexing, person recognition, and vide...
Detecting objects of interest from a video sequence is a fundamental and critical task in automated visual surveillance. Those objects can either be moving or stationary. However,...
In this paper, we propose a speech-adaptive layered-coding (LC) scheme for the loss concealments of real-time CELPcoded speech transmitted over IP networks. Based on the ITU G.729...