Emotional Speech Synthesis using Subspace Constraints in Prosody

14 years 6 months ago

Download www.mega.t-kougei.ac.jp

An efﬁcient speech synthesis method that uses subspace constraint in prosody is proposed. Conventional unit selection methods concatenate speech segments stored in database, that require enormous number of waveforms in synthesizing various emotional expressions with arbitrary texts. The proposed method employs principal component analysis to reduce the dimensionality of prosodic components, that also allows us to generate new speech that are similar to training samples. The subspace constraint assures that the prosody of the synthesized speech including F0, power, and speech length hold their correlative relation that training samples of emotional speech have. We assume that the combination of the number of syllables and the accent type determines the correlative dynamics of prosody, for each of which we individually construct the subspace. The subspace is then linearly related to emotions by multiple regression analysis that are obtained by subjective evaluation for the training sa...

Shinya Mori, Tsuyoshi Moriyama, Shinji Ozawa

Real-time Traffic

ICMCS 2006 | Methods Concatenate Speech | Subspace Constraint | Training Samples |

claim paper

Post Info
More Details (n/a)

Added	11 Jun 2010
Updated	11 Jun 2010
Type	Conference
Year	2006
Where	ICMCS
Authors	Shinya Mori, Tsuyoshi Moriyama, Shinji Ozawa

Comments (0)

Sciweavers

Emotional Speech Synthesis using Subspace Constraints in Prosody

ICMCS 2006 | Methods Concatenate Speech | Subspace Constraint | Training Samples |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers