Classifying Visemes for Automatic Lipreading

14 years 7 months ago

Download wwwhome.cs.utwente.nl

Automatic lipreading is automatic speech recognition that uses only visual information. The relevant data in a video signal is isolated and features are extracted from it. From a sequence of feature vectors, where every vector represents one video image, a sequence of higher level semantic elements is formed. These semantic elements are “visemes” the visual equivalent of “phonemes” The developed prototype uses a Time Delayed Neural Network to classify the visemes.

Michiel Visser, Mannes Poel, Anton Nijholt

Real-time Traffic

Automatic Speech Recognition | Level Semantic Elements | Semantic Elements | Signal Processing | TSD 1999 |

claim paper

Post Info
More Details (n/a)

Added	05 Aug 2010
Updated	05 Aug 2010
Type	Conference
Year	1999
Where	TSD
Authors	Michiel Visser, Mannes Poel, Anton Nijholt

Comments (0)

Sciweavers

Classifying Visemes for Automatic Lipreading

Automatic Speech Recognition | Level Semantic Elements | Semantic Elements | Signal Processing | TSD 1999 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers