LSF mapping for voice conversion with very small training sets

15 years 9 months ago

Download www.cs.tut.fi

To make voice conversion usable in practical applications, the number of training sentences should be minimized. With traditional Gaussian mixture model (GMM) based techniques small training sets lead to over-ﬁtting and estimation problems. We propose a new approach for mapping line spectral frequencies (LSFs) representing the vocal tract. The idea is based on inherent intra-frame correlations of LSFs. For each target LSF, a separate GMM is used and only the source and target LSF elements best correlating with the current LSF are used in training. The proposed method is evaluated both objectively and in listening tests, and it is shown that the method outperforms the conventional GMM approach especially with very small training sets.

Elina Helander, Jani Nurminen, Moncef Gabbouj

Real-time Traffic

ICASSP 2008 | Signal Processing | Small Training Sets | Target Lsf | Target Lsf Elements |

claim paper

Post Info
More Details (n/a)

Added	30 May 2010
Updated	30 May 2010
Type	Conference
Year	2008
Where	ICASSP
Authors	Elina Helander, Jani Nurminen, Moncef Gabbouj

Comments (0)

Sciweavers

LSF mapping for voice conversion with very small training sets

ICASSP 2008 | Signal Processing | Small Training Sets | Target Lsf | Target Lsf Elements |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers