Model-based dereverberation in the logmelspec domain for robust distant-talking speech recognition

14 years 1 months ago

Download www.lms.lnt.de

The REMOS (REverberation MOdeling for Speech recognition) concept for reverberation-robust distant-talking speech recognition, introduced in [1] for melspectral features, is extended in this contribution to logarithmic melspectral (logmelspec) features. Based on a combined acoustic model consisting of a hidden Markov model network and a reverberation model, REMOS determines clean-speech and reverberation estimates during recognition by an inner optimization operation. A reformulation of this inner optimization problem for logmelspec features, allowing an efficient solution by nonlinear optimization algorithms, is derived in this paper so that an efficient implementation of REMOS for logmelspec features becomes possible. Connected digit recognition experiments show that the proposed REMOS implementation significantly outperforms reverberantlytrained HMMs in highly reverberant environments.

Armin Sehr, Roland Maas, Walter Kellermann

Real-time Traffic

ICASSP 2010 | Inner Optimization | Reverberation | Signal Processing | Speech Recognition |

claim paper

Post Info
More Details (n/a)

Added	11 Feb 2011
Updated	11 Feb 2011
Type	Journal
Year	2010
Where	ICASSP
Authors	Armin Sehr, Roland Maas, Walter Kellermann

Comments (0)

Sciweavers

Model-based dereverberation in the logmelspec domain for robust distant-talking speech recognition

ICASSP 2010 | Inner Optimization | Reverberation | Signal Processing | Speech Recognition |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers