Personalising Speech-To-Speech Translation in the EMIME Project

14 years 1 months ago

Download aclweb.org

In the EMIME project we have studied unsupervised cross-lingual speaker adaptation. We have employed an HMM statistical framework for both speech recognition and synthesis which provides transformation mechanisms to adapt the synthesized voice in TTS (text-to-speech) using the recognized voice in ASR (automatic speech recognition). An important application for this research is personalised speech-to-speech translation that will use the voice of the speaker in the input language to utter the translated sentences in the output language. In mobile environments this enhances the users' interaction across language barriers by making the output speech sound more like the original speaker's way of speaking, even if she or he could not speak the output language.

Mikko Kurimo, William Byrne, John Dines, Philip N.

Real-time Traffic

ACL 2010 | Computational Linguistics | HMM Statistical Framework | Output Language | Unsupervised Cross-lingual Speaker |

claim paper

Post Info
More Details (n/a)

Added	10 Feb 2011
Updated	10 Feb 2011
Type	Journal
Year	2010
Where	ACL
Authors	Mikko Kurimo, William Byrne, John Dines, Philip N. Garner, Matthew Gibson, Yong Guan, Teemu Hirsimäki, Reima Karhila, Simon King, Hui Liang, Keiichiro Oura, Lakshmi Saheer, Matt Shannon, Sayaki Shiota, Jilei Tian

Comments (0)

Sciweavers

Personalising Speech-To-Speech Translation in the EMIME Project

ACL 2010 | Computational Linguistics | HMM Statistical Framework | Output Language | Unsupervised Cross-lingual Speaker |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers