Sciweavers

584 search results - page 86 / 117
» Physical Audio Signal Processing
Sort
View
TSD
2004
Springer
14 years 3 months ago
A New Multi-modal Database for Developing Speech Recognition Systems for an Assistive Technology Application
In this paper we report on the acquisition and content of a new database intended for developing audio-visual speech recognition systems. This database supports a speaker dependen...
António Moura, Diamantino Freitas, Vitor Pe...
TSD
2001
Springer
14 years 2 months ago
Augmented Auditory Representation of e-Texts for Text-to-Speech Systems
Abstract. Emerging electronic text formats include hierarchical structure and visualization related information that current Text-to-Speech (TtS) systems ignore. In this paper we p...
Gerasimos Xydas, Georgios Kouroupetroglou
ICASSP
2010
IEEE
13 years 10 months ago
Multi-modal analysis of dance performances for music-driven choreography synthesis
We propose a framework for modeling, analysis, annotation and synthesis of multi-modal dance performances. We analyze correlations between music features and dance figure labels ...
Ferda Ofli, Engin Erzin, Yucel Yemez, A. Murat Tek...
ICASSP
2010
IEEE
13 years 10 months ago
Characterization of movie genre based on music score
While it is clear that the full emotional effect of a movie scene is carried through the successful interpretation of audio and visual information, music still carries a significa...
Aida Austin, Elliot Moore II, Udit Gupta, Parag Ch...
ICASSP
2010
IEEE
13 years 8 months ago
Speech/Non-Speech Detection in Meetings from Automatically Extracted low Resolution Visual Features
In this paper we address the problem of estimating who is speaking from automatically extracted low resolution visual cues in group meetings. Traditionally, the task of speech/non...
Hayley Hung, Sileye O. Ba