Improved GMM-based language recognition using constrained MLLR transforms

16 years 1 months ago

Download www.ll.mit.edu

In this paper we describe the application of a feature-space transform based on constrained maximum likelihood linear regression for unsupervised compensation of channel and speaker variability to the language recognition problem. We show that use of such transforms can improve baseline GMM-based language recognition performance on the 2005 NIST Language Recognition Evaluation (LRE05) task by 38%. Furthermore, gains from CMLLR are additive with other modeling enhancements such as vocal tract length normalization (VTLN). Further improvement is obtained using discriminative training, and it is shown that a system using only CMLLR adaption produces state-of-the-art accuracy with decreased test-time computational cost than systems using VTLN.

Wade Shen, Douglas A. Reynolds

Real-time Traffic

ICASSP 2008 | Language Recognition | Language Recognition Problem | NIST Language Recognition | Signal Processing |

claim paper

Added	30 May 2010
Updated	30 May 2010
Type	Conference
Year	2008
Where	ICASSP
Authors	Wade Shen, Douglas A. Reynolds

Sciweavers

Improved GMM-based language recognition using constrained MLLR transforms

ICASSP 2008 | Language Recognition | Language Recognition Problem | NIST Language Recognition | Signal Processing |

Explore & Download

Productivity Tools

Sciweavers