Large Vocabulary Audio-Visual Speech Recognition Using Active Shape Models

16 years 7 months ago

Download www.research.ibm.com

Orthogonal information present in the video signal associated with the audio helps in improving the accuracy of a speech recognition system. Audio-visual speech recognition involves extraction of both the audio as well as visual features from the input signal. Extraction of visual parameters is done by the recognition of speech dependent features from the video sequence. This paper uses geometrical features to describe the lip shapes. Curve-based Active Shape Models are used to extract the geometry. These geometrically represented visual parameters are used along with the audio cepstral features to perform an audio-visual classification. It is shown that the bimodal system presented here gives an improvement in the classification results over classification using only the audio features.

Tanveer A. Faruquie, Abhik Majumdar, Nitendra Rajp

Real-time Traffic

Audio Cepstral Features | Audio-Visual Speech Recognition | Computer Vision | ICPR 2000 | Speech Dependent Features |

claim paper

» Techniques to Achieve an Accurate RealTime LargeVocabulary Speech Recognition System

» HighAccuracy LargeVocabulary Speech Recognition Using Mixture Tying and Consistency Modeli...

» A study on multilingual acoustic modeling for large vocabulary ASR

» Multilingual Speech Databases at LDC

» Large vocabulary continuous speech recognition with contextdependent DBNHMMS

» A One Pass Decoder Design For Large Vocabulary Recognition

» Multiview and multiobjective semisupervised learning for large vocabulary continuous speec...

» Generating compound words with high order ngram information in large vocabulary speech rec...

Post Info
More Details (n/a)

Added	09 Nov 2009
Updated	09 Nov 2009
Type	Conference
Year	2000
Where	ICPR
Authors	Tanveer A. Faruquie, Abhik Majumdar, Nitendra Rajput, L. Venkata Subramaniam

Comments (0)

Sciweavers

Large Vocabulary Audio-Visual Speech Recognition Using Active Shape Models

Audio Cepstral Features | Audio-Visual Speech Recognition | Computer Vision | ICPR 2000 | Speech Dependent Features |

Explore & Download

Productivity Tools

Sciweavers