Rapid development of speech translation using consecutive interpretation

13 years 8 months ago

Download www.makapa.de

The development of a speech translation (ST) system is costly, largely because it is expensive to collect parallel data. A new language pair is typically only considered in the aftermath of an international crisis that incurs a major need of crosslingual communication. Urgency justifies the deployment of interpreters while data is being collected. In recent work, we have shown that audio recordings of interpreter-mediated communication can present a low-cost data resource for the rapid development of automatic text and speech translation. However, our previous experiments remain limited to English/Spanish simultaneous interpretation. In this work, we examine our approaches for exploiting interpretation audio as translation model training data in the context of English/Pashto consecutive interpretation. We show that our previously made findings remain valid, despite the more complex language pair and the additional challenges introduced by the strong resource-limitations of Pashto.

Matthias Paulik, Alex Waibel

Real-time Traffic

INTERSPEECH 2010 | Language Pair | Low-cost Data Resource | Signal Processing | Speech Translation |

claim paper

Post Info
More Details (n/a)

Added	19 May 2011
Updated	19 May 2011
Type	Journal
Year	2010
Where	INTERSPEECH
Authors	Matthias Paulik, Alex Waibel

Comments (0)

Sciweavers

Rapid development of speech translation using consecutive interpretation

INTERSPEECH 2010 | Language Pair | Low-cost Data Resource | Signal Processing | Speech Translation |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers