Learning vocal tract variables with multi-task kernels

14 years 10 months ago

Download www.grappa.univ-lille3.fr

The problem of acoustic-to-articulatory speech inversion continues to be a challenging research problem which signiﬁcantly impacts automatic speech recognition robustness and accuracy. This paper presents a multi-task kernel based method aimed at learning Vocal Tract (VT) variables from the Mel-Frequency Cepstral Coefﬁcients (MFCCs). Unlike usual speech inversion techniques based on individual estimation of each tract variable, the key idea here is to consider all the target variables simultaneously to take advantage of the relationships among them and then improve learning performance. The proposed method is evaluated using synthetic speech dataset and corresponding tract variables created by the TAsk Dynamics Application (TADA) model and compared to the hierarchical ε-SVR speech inversion technique.

Hachem Kadri, Emmanuel Duflos, Philippe Preux

Real-time Traffic

ICASSP 2011 | Signal Processing | Speech Inversion Technique | Speech Recognition Robustness | Usual Speech Inversion |

claim paper

» Speakeradaptive learning of resonance targets in a hidden trajectory model of speech coart...

» Use of VTLwise models in featuremapping framework to achieve performance of multiplebackgr...

Post Info
More Details (n/a)

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Hachem Kadri, Emmanuel Duflos, Philippe Preux

Comments (0)

Sciweavers

Learning vocal tract variables with multi-task kernels

ICASSP 2011 | Signal Processing | Speech Inversion Technique | Speech Recognition Robustness | Usual Speech Inversion |

Explore & Download

Productivity Tools

Sciweavers