Sciweavers

ICASSP
2011
IEEE

Preserve ordering property of generated LSPS for minimum generation error training in HMM-based speech synthesis

13 years 3 months ago
Preserve ordering property of generated LSPS for minimum generation error training in HMM-based speech synthesis
Ordering property is an important property of LSP and closely connected with the naturalness of reconstructed speech. When LSP is adopted as spectrum feature in HMM-based parametric speech synthesis, the ordering property cannot be guaranteed because diagonal covariance matrix is used in conventional system and the crossdimension correlation of LSP vector is ignored. It will cause unstable issue in synthesized speech. In this paper, we propose some methods to preserve the ordering property of generated LSPs for MGE training by introducing mis-ordering related distance measurements into model training criterion. Experimental results show that two methods can alleviate the mis-orderings significantly without degrading the MGE performance, and one of which, the minimum mis-ordering counting method, requires no acoustic observations for model optimization.
Ming Lei, Zhen-Hua Ling, Li-Rong Dai
Added 21 Aug 2011
Updated 21 Aug 2011
Type Journal
Year 2011
Where ICASSP
Authors Ming Lei, Zhen-Hua Ling, Li-Rong Dai
Comments (0)