Extended VTS for noise-robust speech recognition

16 years 1 months ago

Download mi.eng.cam.ac.uk

Model compensation is a standard way of improving the robustness of speech recognition systems to noise. A number of popular schemes are based on vector Taylor series (vts) compensation, which uses a linear approximation to represent the influence of noise on the clean speech. To compensate the dynamic parameters, the continuous time approximation is often used. This approximation uses a point estimate of the gradient, which fails to take into account that dynamic coefficients are a function of a number of consecutive static coefficients. In this paper, the accuracy of dynamic parameter compensation is improved by representing the dynamic features as a linear transformation of a window of static features. A modified version of vts compensation is applied to the distribution of the window of static features and, importantly, their correlations. These compensated distributions are then transformed to distributions over standard static and dynamic features. With this improved approximati...

Rogier C. van Dalen, Mark J. F. Gales

Real-time Traffic

Dynamic Features | Dynamic Parameter | ICASSP 2009 | Signal Processing | Static Features |

claim paper

» Structured discriminative models for noise robust continuous speech recognition

» Factor analysis based VTS and JUD noise estimation and compensation

» Normalized Training for HMMBased Visual Speech Recognition

» Acoustic model adaptation via Linear Spline Interpolation for robust speech recognition

Post Info
More Details (n/a)

Added	21 May 2010
Updated	21 May 2010
Type	Conference
Year	2009
Where	ICASSP
Authors	Rogier C. van Dalen, Mark J. F. Gales

Comments (0)

Sciweavers

Extended VTS for noise-robust speech recognition

Dynamic Features | Dynamic Parameter | ICASSP 2009 | Signal Processing | Static Features |

Explore & Download

Productivity Tools

Sciweavers