Visual speech synthesis by modelling coarticulation dynamics using a non-parametric switching state-space model

14 years 11 months ago

Download www.cs.man.ac.uk

We present a novel approach to speech-driven facial animation using a non-parametric switching state space model based on Gaussian processes. The model is an extension of the shared Gaussian process dynamical model, augmented with switching states. Audio and visual data from a talking head corpus are jointly modelled using the proposed method. The switching states are found using variable length Markov models trained on labelled phonetic data. We also propose a synthesis technique that takes into account both previous and future phonetic context, thus accounting for coarticulatory effects in speech. Categories and Subject Descriptors I.5.4 [Image Processing and Computer Vision]: Applications--Computer vision, Signal processing Keywords speech-driven facial animation, visual speech synthesis, artificial talking head General Terms algorithms, theory, experimentation

Salil Deena, Shaobo Hou, Aphrodite Galata

Real-time Traffic

Biometrics | ICMI 2010 | Non-parametric Switching State | Speech-driven Facial Animation | Switching States |

claim paper

Added	04 Mar 2011
Updated	04 Mar 2011
Type	Journal
Year	2010
Where	ICMI
Authors	Salil Deena, Shaobo Hou, Aphrodite Galata

Sciweavers

Visual speech synthesis by modelling coarticulation dynamics using a non-parametric switching state-space model

Biometrics | ICMI 2010 | Non-parametric Switching State | Speech-driven Facial Animation | Switching States |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers