HMM-based sequence-to-frame mapping for voice conversion

15 years 7 months ago

Download www.gavo.t.u-tokyo.ac.jp

Voice conversion can be reduced to a problem to ﬁnd a transformation function between the corresponding speech sequences of two speakers. Perhaps the most voice conversions methods are GMMbased statistical mapping methods [1, 2]. However, the classical GMM-based mapping is frame-to-frame, and cannot take account of the contextual information existing over a speech sequence. It is well known that HMM yields an efﬁcient method to model the density of a whole speech sequence and has found great successes in speech recognition and synthesis. Inspired by this fact, this paper studies how to use HMM for voice conversion. We derive an HMMbased sequence-to-frame mapping function with statistical analysis. Different from previous HMM-based voice conversion methods [3, 4, 5] that used forced alignment for segmentation and transform frames aligned to a state with its associated linear transformation, our method has a soft mapping function as a weighted summation of linear transformations. Th...

Yu Qiao, Daisuke Saito, Nobuaki Minematsu

Real-time Traffic

ICASSP 2010 | Mapping Functions | Signal Processing | Speech Sequence | Voice Conversion |

claim paper

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Yu Qiao, Daisuke Saito, Nobuaki Minematsu

Comments (0)

Sciweavers

HMM-based sequence-to-frame mapping for voice conversion

ICASSP 2010 | Mapping Functions | Signal Processing | Speech Sequence | Voice Conversion |

Explore & Download

Productivity Tools

Sciweavers