Sciweavers

551 search results - page 94 / 111
» Multimodal Speech Synthesis
Sort
View
MOBISYS
2005
ACM
16 years 4 months ago
LiveMail: personalized avatars for mobile entertainment
LiveMail is a prototype system that allows mobile subscribers to communicate using personalized 3D face models created from images taken by their phone cameras. The user takes a s...
Miran Mosmondor, Tomislav Kosutic, Igor S. Pandzic
ICASSP
2011
IEEE
14 years 8 months ago
Continuous F0 in the source-excitation generation for HMM-based TTS: Do we need voiced/unvoiced classification?
Most HMM-based TTS systems use a hard voiced/unvoiced classification to produce a discontinuous F0 signal which is used for the generation of the source-excitation. When a mixed ...
Javier Latorre, Mark J. F. Gales, Sabine Buchholz,...
CW
2006
IEEE
15 years 10 months ago
An Interactive Mixed Reality Framework for Virtual Humans
In this paper, we present a simple and robust Mixed Reality (MR) framework that allows for real-time interaction with Virtual Humans in real and virtual environments under consist...
Arjan Egges, George Papagiannakis, Nadia Magnenat-...
IUI
2006
ACM
15 years 10 months ago
Three phase verification for spoken dialog clarification
Spoken dialog tasks incur many errors including speech recognition errors, understanding errors, and even dialog management errors. These errors create a big gap between user'...
Sangkeun Jung, Cheongjae Lee, Gary Geunbae Lee
NOLISP
2005
Springer
15 years 10 months ago
A Simple, Quasi-linear, Discrete Model of Vocal Fold Dynamics
In current speech technology, linear prediction dominates. The linear vocal tract model is well justified biomechanically, and linear prediction is a simple and well understood si...
Max Little, Patrick McSharry, Irene Moroz, Stephen...