Speaker independent visual-only language identification

15 years 6 months ago

Download www.uea.ac.uk

We describe experiments in visual-only language identiﬁcation (VLID), in which only lip shape, appearance and motion are used to determine the language of a spoken utterance. In previous work, we had shown that this is possible in speaker-dependent mode, i.e. identifying the language spoken by a multi-lingual speaker. Here, by appropriately modifying techniques that have been successful in audio language identiﬁcation, we extend the work to discriminating two languages in speaker-independent mode. Our results indicate that even with viseme accuracy as low as about 34%, reasonable discrimination can be obtained. A simulation of degraded accuracy viseme recognition performance indicates that high VLID accuracy should be achievable with viseme recognition errors of the order of 50%.

Jacob L. Newman, Stephen J. Cox

Real-time Traffic

Audio Language Identiﬁcation | ICASSP 2010 | Signal Processing | Spoken Utterance | Viseme Recognition |

claim paper

» Text Independent Speaker Identification in Multilingual Environments

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Jacob L. Newman, Stephen J. Cox

Comments (0)

Sciweavers

Speaker independent visual-only language identification

Audio Language Identiﬁcation | ICASSP 2010 | Signal Processing | Spoken Utterance | Viseme Recognition |

Explore & Download

Productivity Tools

Sciweavers