Phoneme cluster based state mapping for text-independent voice conversion

15 years 5 months ago

Download nlpr-web.ia.ac.cn

This paper takes phonetic information into account for data alignment in text-independent voice conversion. Hidden Markov Models are used for representing the phonetic structure of training speech. States belonging to same phoneme are grouped together to form a phoneme cluster. A state mapped codebook based transformation is established using information on the corresponding phoneme clusters from source and targets speech and weighted linear transform. For each source vector, several nearest clusters are considered simultaneously while mapping in order to generate a continuous and stable transform. Experimental results indicate that the proposed use of phonetic information increases the similarity between converted speech and target speech. The proposed technique is applicable to both intra-lingual and cross-lingual voice conversion.

Meng Zhang, Jiaohua Tao, Jani Nurminen, Jilei Tian

Real-time Traffic

ICASSP 2009 | Phoneme Clusters | Phonetic Information | Signal Processing | Voice Conversion |

claim paper

Post Info
More Details (n/a)

Added	18 Feb 2011
Updated	18 Feb 2011
Type	Journal
Year	2009
Where	ICASSP
Authors	Meng Zhang, Jiaohua Tao, Jani Nurminen, Jilei Tian, Xia Wang

Comments (0)

Sciweavers

Phoneme cluster based state mapping for text-independent voice conversion

ICASSP 2009 | Phoneme Clusters | Phonetic Information | Signal Processing | Voice Conversion |

Explore & Download

Productivity Tools

Sciweavers