Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments

13 years 6 months ago

Download www.utdallas.edu

In the presence of environmental noise, speakers tend to adjust their speech production in an effort to preserve intelligible communication. The noise-induced speech adjustments, called Lombard effect (LE), are known to severely impact the accuracy of automatic speech recognition (ASR) systems. The reduced performance results from the mismatch between the ASR acoustic models trained typically on noise-clean neutral (modal) speech and the actual parameters of noisy LE speech. In this study, novel unsupervised frequency domain and cepstral domain equalizations that increase ASR resistance to LE are proposed and incorporated in a recognition scheme employing a codebook of noisy acoustic models. In the frequency domain, short-time speech spectra are transformed towards neutral ASR acoustic models in a maximum likelihood fashion. Simultaneously, dynamics of cepstral samples are determined from the quantile estimates and normalized to a constant range. A codebook decoding strategy is applied...

Hynek Boril, John H. L. Hansen

Real-time Traffic

Acoustic Models | ASR Acoustic Models | LE Speech | Software Engineering | TASLP 2010 |

claim paper

Post Info
More Details (n/a)

Added	21 May 2011
Updated	21 May 2011
Type	Journal
Year	2010
Where	TASLP
Authors	Hynek Boril, John H. L. Hansen

Comments (0)

Sciweavers

Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments

Acoustic Models | ASR Acoustic Models | LE Speech | Software Engineering | TASLP 2010 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers