HMM-based speech synthesiser using the LF-model of the glottal source

14 years 10 months ago

Download www.cstr.ed.ac.uk

A major factor which causes a deterioration in speech quality in HMM-based speech synthesis is the use of a simple delta pulse signal to generate the excitation of voiced speech. This paper sets out a new approach to using an acoustic glottal source model in HMM-based synthesisers instead of the traditional pulse signal. The goal is to improve speech quality and to better model and transform voice characteristics. We have found the new method decreases buzziness and also improves prosodic modelling. A perceptual evaluation has supported this ﬁnding by showing a 55.6% preference for the new system, as against the baseline. This improvement, while not being as signiﬁcant as we had initially expected, does encourage us to work on developing the proposed speech synthesiser further.

João P. Cabral, Steve Renals, Junichi Yamag

Real-time Traffic

Delta Pulse Signal | HMM-based Speech Synthesis | ICASSP 2011 | Pulse Signal | Signal Processing |

claim paper

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	João P. Cabral, Steve Renals, Junichi Yamagishi, Korin Richmond

Sciweavers

HMM-based speech synthesiser using the LF-model of the glottal source

Delta Pulse Signal | HMM-based Speech Synthesis | ICASSP 2011 | Pulse Signal | Signal Processing |

Explore & Download

Productivity Tools

Sciweavers