Sciweavers

TASLP
2010

Hierarchical Bayesian Language Models for Conversational Speech Recognition

13 years 7 months ago
Hierarchical Bayesian Language Models for Conversational Speech Recognition
Traditional n-gram language models are widely used in state-of-the-art large vocabulary speech recognition systems. This simple model suffers from some limitations, such as overfitting of maximum likelihood estimation and the lack of rich contextual knowledge sources. In this paper, we exploit a hierarchical Bayesian interpretation for language modeling, based on a nonparametric prior called PitmanYor process. This offers a principled approach to language model smoothing, embedding the power-law distribution for natural language. Experiments on the recognition of conversational speech in multiparty meetings demonstrate that by using hierarchical Bayesian language models, we are able to achieve significant reductions in perplexity and word error rate.
Songfang Huang, Steve Renals
Added 21 May 2011
Updated 21 May 2011
Type Journal
Year 2010
Where TASLP
Authors Songfang Huang, Steve Renals
Comments (0)