Sciweavers

ICPR
2010
IEEE

Lipreading: A Graph Embedding Approach

14 years 1 months ago
Lipreading: A Graph Embedding Approach
In this paper, we propose a novel graph embedding method for the problem of lipreading. To characterize the temporal connections among video frames of the same utterance, a new distance metric is defined on a pair of frames and graphs are constructed to represent the video dynamics based on the distances between frames. Audio information is used to assist in calculating such distances. For each utterance, a subspace of the visual feature space is learned from a well-defined intrinsic and penalty graph within a graph-embedding framework. Video dynamics are found to be well preserved along some dimensions of the subspace. Discriminatory cues are then decoded from curves of the projected visual features to classify different utterances.
Ziheng Zhou, Guoying Zhao, Matti Pietikäinen
Added 12 Oct 2010
Updated 12 Oct 2010
Type Conference
Year 2010
Where ICPR
Authors Ziheng Zhou, Guoying Zhao, Matti Pietikäinen
Comments (0)