We examine the utility of speech and lexical features for predicting student emotions in computerhuman spoken tutoring dialogues. We first annotate student turns for negative, neu...
We are interested in recovering aspects of vocal tract’s geometry and dynamics from auditory and visual speech cues. We approach the problem in a statistical framework based on ...
Athanassios Katsamanis, George Papandreou, Petros ...
This paper proposes a speech comprehension computational model based on neurocognitiveresearches. The computational representation uses techniques as wavelets transform and connec...
Enriching a pronunciation dictionary with phonological variation is a challenging task, not yet solved despite several decades of research, in particular for speech-to-text transc...
Abstract. Gender and age estimation based on Gaussian Mixture Models (GMM) is introduced. Telephone recordings from the Czech SpeechDatEast database are used as training and test d...