Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

117

ACSC
2004
IEEE

favoriteEmaildiscussreport

114views Theoretical Computer Science» more ACSC 2004»

Sensor Fusion Weighting Measures in Audio-Visual Speech Recognition

15 years 5 months ago

Sensor Fusion Weighting Measures in Audio-Visual Speech Recognition

Download www.acs.org.au

Audio-Visual Speech Recognition (AVSR) uses vision to enhance speech recognition but also introduces the problem of how to join (or fuse) these two signals together. Mainstream research achieves this using a weighted product of the output of the phoneme classifiers for both modalities. This paper analyses current weighting measures and compares them to several new measures proposed by the authors. Most importantly, when calculating the dispersion of the output there is a shift from analysing the variance to analysing the skewness of the distribution. Experiments in AVSR using neural networks raise questions of the utility of such measures with some intriguing results.

Trent W. Lewis, David M. W. Powers

Real-time Traffic

ACSC 2004 | Audio-Visual Speech Recognition | Current Weighting Measures | Speech Recognition | Theoretical Computer Science |

claim paper

Related Content

» Multimodal information fusion using the iterative decoding algorithm and its application t...

» Multibiometric Fusion for Driver Authentication on the Example of Speech and Face

» 6DMG a new 6D motion gesture database

» Pitt at CLEF05 Data Fusion for Spoken Document Retrieval

» Robust Automatic Human Identification Using Face Mouth and Acoustic Information

» Analysing the performance of visual concept and text features in contentbased video retrie...

Post Info
More Details (n/a)

Added	20 Aug 2010
Updated	20 Aug 2010
Type	Conference
Year	2004
Where	ACSC
Authors	Trent W. Lewis, David M. W. Powers

Comments (0)