The role of higher-level linguistic features in HMM-based speech synthesis

15 years 1 months ago

Download www.cstr.ed.ac.uk

We analyse the contribution of higher-level elements of the linguistic specification of a data-driven speech synthesiser to the naturalness of the synthetic speech which it generates. The system is trained using various subsets of the full feature-set, in which features relating to syntactic category, intonational phrase boundary, pitch accent and boundary tones are selectively removed. Utterances synthesised by the different configurations of the system are then compared in a subjective evaluation of their naturalness. The work presented forms background analysis for an ongoing set of experiments in performing text-to-speech (TTS) conversion based on shallow features: features that can be trivially extracted from text. By building a range of systems, each assuming the availability of a different level of linguistic annotation, we obtain benchmarks for our on-going work.

Oliver Watts, Junichi Yamagishi, Simon King

Real-time Traffic

Boundary Tones | Data-driven Speech Synthesiser | INTERSPEECH 2010 | Linguistic Specification | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	18 May 2011
Updated	18 May 2011
Type	Journal
Year	2010
Where	INTERSPEECH
Authors	Oliver Watts, Junichi Yamagishi, Simon King

Comments (0)

Sciweavers

The role of higher-level linguistic features in HMM-based speech synthesis

Boundary Tones | Data-driven Speech Synthesiser | INTERSPEECH 2010 | Linguistic Specification | Signal Processing |

Explore & Download

Productivity Tools

Sciweavers