Speech-Video Synchronization Using Lips Movements and Speech Envelope Correlation

15 years 23 days ago

Download www.csse.uwa.edu.au

In this paper, we propose a novel correlation based method for speech-video synchronization (synch) and relationship classiﬁcation. The method uses the envelope of the speech signal and data extracted from the lips movement. Firstly, a nonlinear-time-varying model is considered to represent the speech signal as a sum of amplitude and frequency modulated (AM-FM) signals. Each AM-FM signal, in this sum, is considered to model a single speech formant frequency. Using Taylor series expansion, the model is formulated in a way which characterizes the relation between the speech amplitude and the instantaneous frequency of each AM-FM signal w.r.t lips movements. Secondly, the envelope of the speech signal is estimated and then correlated with signals generated from lips movement. From the resultant correlation, the relation between the two signals is classiﬁed and the delay between them is estimated. The proposed method is applied to real cases and the results show that it is able to (i) ...

Amar A. El-Sallam, Ajmal S. Mian

Real-time Traffic

AM-FM Signal | ICIAR 2009 | Image | Lips Movement | Speech Signals |

claim paper

Added	26 May 2010
Updated	26 May 2010
Type	Conference
Year	2009
Where	ICIAR
Authors	Amar A. El-Sallam, Ajmal S. Mian

Sciweavers

Speech-Video Synchronization Using Lips Movements and Speech Envelope Correlation

AM-FM Signal | ICIAR 2009 | Image | Lips Movement | Speech Signals |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers