Sciweavers

ICASSP
2008
IEEE

Irrelevant variability normalization based HMM training using map estimation of feature transforms for robust speech recognition

14 years 7 months ago
Irrelevant variability normalization based HMM training using map estimation of feature transforms for robust speech recognition
In the past several years, we’ve been studying feature transformation (FT) approaches to robust automatic speech recognition (ASR) which can compensate for possible “distortions” caused by factors irrelevant to phonetic classification in both training and recognition stages. Several FT functions with different degrees of flexibility have been studied and the corresponding maximum likelihood (ML) training techniques developed. In this paper, we study yet another new FT function which takes the most flexible form of frame-dependent linear transformation. Maximum a posteriori (MAP) estimation is used for estimating FT function parameters to deal with the possible problem of insufficient training data caused by the increased number of model parameters. The effectiveness of the proposed approach is confirmed by evaluation experiments on Finnish Aurora3 database.
Donglai Zhu, Qiang Huo
Added 30 May 2010
Updated 30 May 2010
Type Conference
Year 2008
Where ICASSP
Authors Donglai Zhu, Qiang Huo
Comments (0)