Sciweavers

123 search results - page 23 / 25
» Improving Acoustic Models with Captioned Multimedia Speech
Sort
View
ICNC
2010
Springer
13 years 5 months ago
Emotional talking agent: System and evaluation
In this paper, we introduce a system that synthesizes the emotional audio-visual speech for a 3-D talking agent by adopting the PAD (Pleasure-Arousal-Dominance) emotional model. A ...
Shen Zhang, Jia Jia, Yingjin Xu, Lianhong Cai
MM
2009
ACM
221views Multimedia» more  MM 2009»
14 years 2 months ago
Using large-scale web data to facilitate textual query based retrieval of consumer photos
The rapid popularization of digital cameras and mobile phone cameras has lead to an explosive growth of consumer photo collections. In this paper, we present a (quasi) real-time t...
Yiming Liu, Dong Xu, Ivor W. Tsang, Jiebo Luo
IEEEMSP
2002
IEEE
117views Multimedia» more  IEEEMSP 2002»
14 years 16 days ago
Hidden Markov model for automatic transcription of MIDI signals
— This paper describes a Hidden Markov Model (HMM)-based method of automatic transcription of MIDI (Musical Instrument Digital Interface) signals of performed music. The problem ...
Haruto Takeda, Naoki Saito, Tomoshi Otsuki, Mitsur...
CLEAR
2007
Springer
136views Biometrics» more  CLEAR 2007»
14 years 1 months ago
The ISL RT-07 Speech-to-Text System
Abstract. This paper describes the 2007 meeting speech-to-text system for lecture rooms developed at the Interactive Systems Laboratories (ISL), for the multiple distant microphone...
Matthias Wölfel, Sebastian Stüker, Flori...
COST
2009
Springer
203views Multimedia» more  COST 2009»
14 years 2 months ago
Multiple Feature Extraction and Hierarchical Classifiers for Emotions Recognition
Abstract. The recognition of the emotional states of speaker is a multidisciplinary research area that has received great interest in the last years. One of the most important goal...
Enrique M. Albornoz, Diego H. Milone, Hugo Leonard...