Sciweavers

22
Voted
ICASSP
2009
IEEE

Probablistic modelling of F0 in unvoiced regions in HMM based speech synthesis

14 years 6 months ago
Probablistic modelling of F0 in unvoiced regions in HMM based speech synthesis
HMM based synthesis has attracted great interest due to its compact and flexible modelling of spectral and prosodic parameters. In this approach, short term spectra, fundamental frequency (F0) and duration are simultaneously modelled by multi-stream HMMs. However, since F0 values in unvoiced regions are normally considered as undefined, it is difficult to use standard HMMs for F0 modelling. The currently preferred solution to this is to use a multi-space distribution HMM (MSDHMM) in which discrete distributions are used for modelling the voiced/unvoiced decision and continuous Gaussian distributions are used for modelling the F0 values within the voiced regions. However, the assumption of undefined unvoiced F0 regions and the special structure of the MSDHMM lead to limitations in the accurate modelling of F0 patterns. In this paper an alternative is explored whereby unvoiced F0 values are assumed to exist and are modelled within the standard HMM framework using a globally tied dis...
Kai Yu, Tomoki Toda, Milica Gasic, Simon Keizer, F
Added 21 May 2010
Updated 21 May 2010
Type Conference
Year 2009
Where ICASSP
Authors Kai Yu, Tomoki Toda, Milica Gasic, Simon Keizer, François Mairesse, Blaise Thomson, Steve Young
Comments (0)