Multimodal information fusion using the iterative decoding algorithm and its application to audio-visual speech recognition

14 years 9 months ago

Download www.itr-rescue.org

The fusion of information from heterogenous sensors is crucial to the effectiveness of a multimodal system. Noise affect the sensors of different modalities independently. A good fusion scheme should be able to use local estimates of the reliability of each modality to weight the decisions. This paper presents an iterative decoding based information fusion scheme motivated by the theory of turbo codes. This fusion framework is developed in the context of hidden Markov models. We present the mathematical framework of the fusion scheme. We then apply this algorithm to an audio-visual speech recognition task on the GRID audio-visual speech corpus and present the results.

Shankar T. Shivappa, Bhaskar D. Rao, Mohan M. Triv

Real-time Traffic

Audio-visual Speech | Fusion Scheme | ICASSP 2008 | Information Fusion Scheme | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	30 May 2010
Updated	30 May 2010
Type	Conference
Year	2008
Where	ICASSP
Authors	Shankar T. Shivappa, Bhaskar D. Rao, Mohan M. Trivedi

Comments (0)

Sciweavers

Multimodal information fusion using the iterative decoding algorithm and its application to audio-visual speech recognition

Audio-visual Speech | Fusion Scheme | ICASSP 2008 | Information Fusion Scheme | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers