Normalized Training for HMM-Based Visual Speech Recognition

16 years 8 months ago

Download www.sp.nitech.ac.jp

This paper presents an approach to estimating the parameters of continuous density HMMs for visual speech recognition. One of the key issues of image-based visual speech recognition is normalization of lip location and lighting condition prior to estimating the parameters of HMMs. We presented a normalized training method in which the normalization process is integrated in the model training. This paper extends it for contrast normalization in addition to average-intensity and location normalization. The proposed method provides a theoretically-well-defined algorithm based on a maximum likelihood formulation, hence the likelihood for the training data is guaranteed to increase at each iteration of the normalized training. Experiments on M2VTS database show that the recognition performance can be significantly improved by the normalized training.

Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamur

Real-time Traffic

Contrast Normalization | ICIP 2000 | Image Processing | Normalized Training Method | Visual Speech Recognition |

claim paper

» Onedimensional representation of twodimensional information for HMM based handwriting reco...

» Acoustic model training for nonaudible murmur recognition using transformed normal speech ...

» Irrelevant variability normalization based HMM training using map estimation of feature tr...

» Implicit Trajectory Modeling through Gaussian Transition Models for Speech Recognition

» Reservoirbased techniques for speech recognition

» Gesturebased Dynamic Bayesian Network for noise robust speech recognition

» A study of an irrelevant variability normalization based discriminative training approach ...

» Speech Recognition Model for Tamil Stops

Post Info
More Details (n/a)

Added	25 Oct 2009
Updated	25 Oct 2009
Type	Conference
Year	2000
Where	ICIP
Authors	Yoshihiko Nankaku, Keiichi Tokuda, Tadashi Kitamura, Takao Kobayashi

Comments (0)

Sciweavers

Normalized Training for HMM-Based Visual Speech Recognition

Contrast Normalization | ICIP 2000 | Image Processing | Normalized Training Method | Visual Speech Recognition |

Explore & Download

Productivity Tools

Sciweavers