Modelling the prepausal lengthening effect for speech recognition: a dynamic Bayesian network approach

14 years 6 months ago

Download ssli.ee.washington.edu

Speech has a property that the speech unit preceding a speech pause tends to lengthen. This work presents the use of a dynamic Bayesian network to model the prepausal lengthening effect for robust speech recognition. Speciﬁcally, we introduce two distributions to model inter-state transitions in prepausal and non-prepausal words, respectively. The selection of the transition distributions depends on a random variable whose value is inﬂuenced by whether a pause will appear between the current and the following word. Two experiments are presented here. The ﬁrst one considers pauses hypothesised during speech decoding. The second one employs an extra component for speech/non-speech determination. By modelling the prepausal lengthening effect we achieve a 5.5% relative reduction in word error rate on the 500-word task of the SVitchboard corpus.

Ning Ma, Chris Bartels, Jeff A. Bilmes, Phil Green

Real-time Traffic

ICASSP 2009 | Prepausal Lengthening Effect | Robust Speech Recognition | Signal Processing | Speech Pause |

claim paper

Post Info
More Details (n/a)

Added	21 May 2010
Updated	21 May 2010
Type	Conference
Year	2009
Where	ICASSP
Authors	Ning Ma, Chris Bartels, Jeff A. Bilmes, Phil Green

Comments (0)

Sciweavers

Modelling the prepausal lengthening effect for speech recognition: a dynamic Bayesian network approach

ICASSP 2009 | Prepausal Lengthening Effect | Robust Speech Recognition | Signal Processing | Speech Pause |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers