In this paper, camera motion detection methods using a background image generated by video mosaicing based on the correlation between feature points on a frame pair are described....
The rate-distortion optimal mode decision as well as motion estimation adopted in H.264 brings a big challenge to realtime encoding and transcoding duo to the high computation com...
Yi Wang, Xiaoyan Sun, Feng Wu, Shipeng Li, Houqian...
Blind multiplicative watermarking schemes for speech signals using wavelets and discrete cosine transform are presented. Watermarked signals are modeled using a generalized Gaussi...
Multimodal speech and speaker modeling and recognition are widely accepted as vital aspects of state of the art human-machine interaction systems. While correlations between speec...
Mehmet Emre Sargin, Oya Aran, Alexey Karpov, Ferda...
A novel video encoding and splicing method is proposed which minimizes the tune-in time of “channel zapping”, i.e. changing from one audiovisual service to another, in IPDC ov...
In this paper, we present a speaker identification algorithm for a microphone array based on a first-order joint Hidden Markov Model (HMM) where the observations correspond to t...
In this paper, we propose the seamless video transmission scheme over wireless LANs during fast handover in Mobile IPv6. In fact, a mobile node cannot receive the IP packets durin...
This paper investigates postfiltering for residual echo suppression in networks employing low-bit-rate speech compression in the echo path. Simulations show that the residual echo...
In this work, we cope with the problem of identifying the number of repetitions of a specific video clip in a target video clip. Generally, the methods that deal with this proble...
The MISP Processor is a programmable media processor which supports multi-issuing, multi-threading and stream processing techniques. MISP executes applications that have been mapp...