Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model

14 years 16 days ago

Download spalab.naist.jp

In this paper, we describe a statistical approach to both an articulatory-to-acoustic mapping and an acoustic-to-articulatory inversion mapping without using phonetic information. The joint probability density of an articulatory parameter and an acoustic parameter is modeled using a Gaussian mixture model (GMM) based on a parallel acoustic-articulatory speech database. We apply the GMM-based mapping using the minimum mean-square error (MMSE) criterion, which has been proposed for voice conversion, to the two mappings. Moreover, to improve the mapping performance, we apply maximum likelihood estimation (MLE) to the GMM-based mapping method. The determination of a target parameter trajectory having appropriate static and dynamic properties is obtained by imposing an explicit relationship between static and dynamic features in the MLE-based mapping. Experimental results demonstrate that the MLE-based mapping with dynamic features can significantly improve the mapping performance compared...

Tomoki Toda, Alan W. Black, Keiichi Tokuda

Real-time Traffic

Acoustic-to-articulatory Inversion Mapping | Articulatory-to-acoustic Mapping | Inversion Mapping | SPEECH 2008 |

claim paper

Post Info
More Details (n/a)

Added	15 Dec 2010
Updated	15 Dec 2010
Type	Journal
Year	2008
Where	SPEECH
Authors	Tomoki Toda, Alan W. Black, Keiichi Tokuda

Comments (0)

Sciweavers

Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model

Acoustic-to-articulatory Inversion Mapping | Articulatory-to-acoustic Mapping | Inversion Mapping | SPEECH 2008 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers