In this paper, we introduce a system that synthesizes the emotional audio-visual speech for a 3-D talking agent by adopting the PAD (Pleasure-Arousal-Dominance) emotional model. A ...
■ Audiovisual speech perception provides an opportunity to investigate the mechanisms underlying multimodal processing. By using nonspeech stimuli, it is possible to investigate...
Marco Loh, Gabriele Schmid, Gustavo Deco, Wolfram ...
Music information processing has become very important due to the ever-growing amount of music data from emerging applications. In this demonstration, we present a novel approach ...
In an attempt to improve models of human perception, the recognition of phonemes in nonsense utterances was predicted with automatic speech recognition (ASR) in order to analyze i...
Automatic audio classification usually considers sounds as music, speech, silence or noise, but works about the noise class are rare. Audio features are generally specific to sp...
Pierre Hanna, Nicolas Louis, Myriam Desainte-Cathe...