This paper presents an approach to estimating the parameters of continuous density HMMs for visual speech recognition. One of the key issues of image-based visual speech recogniti...
This paper describes a new motion based feature extraction technique for speaker identification using orientation estimation in 2D manifolds. The motion is estimated by computing ...
Automatic image tagging is important yet challenging due to the semantic gap and the lack of learning examples to model a tag’s visual diversity. Meanwhile, social user tagging ...
This paper considers a method for speech emotion recognition by a max-margin framework incorporating a loss function based on a well-known model called the Watson and Tellegen’s...
Query-by-tapping systems are content-based music retrieval systems that allow users to tap or clap in a microphone the rhythmic pattern of the melody requested. In this paper, a n...