Robust speech recognition using dynamic noise adaptation

13 years 3 months ago

Download mirlab.org

Dynamic noise adaptation (DNA) [1, 2] is a model-based technique for improving automatic speech recognition (ASR) performance in noise. DNA has shown promise on artiﬁcially mixed data such as the Aurora II and DNA+Aurora II tasks [1]—signiﬁcantly outperforming well-known techniques like the ETSI AFE and fMLLR [2]—but has never been tried on real data. In this paper, we present new results generated by commercial-grade ASR systems trained on large amounts of data. We show that DNA improves upon the performance of the spectral subtraction (SS) and stochastic fMLLR algorithms of our embedded recognizers, particularly in unseen noise conditions, and describe how DNA has been evolved to become suitable for deployment in low-latency ASR systems. DNA improves our best embedded system, which utilizes SS, fMLLR, and fMPE [3] by over 22% relative at SNRs below 6 dB, reducing the word error rate in these adverse conditions from 4.24% to 3.29%.

Steven J. Rennie, Pierre L. Dognin, Petr Fousek

Real-time Traffic

Asr Systems | Commercial-grade Asr Systems | Dynamic Noise Adaptation | ICASSP 2011 | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	20 Aug 2011
Updated	20 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Steven J. Rennie, Pierre L. Dognin, Petr Fousek

Comments (0)

Sciweavers

Robust speech recognition using dynamic noise adaptation

Asr Systems | Commercial-grade Asr Systems | Dynamic Noise Adaptation | ICASSP 2011 | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers