Cross-language bootstrapping based on completely unsupervised training using multilingual A-stabil

13 years 8 months ago

Download mirlab.org

This paper presents our work on rapid language adaptation of acoustic models based on multilingual cross-language bootstrapping and unsupervised training. We used Automatic Speech Recognition (ASR) systems in English, French, German, and Spanish to build a Czech ASR system from scratch. System building was performed without using any transcribed audio data by applying three consecutive steps, i.e. cross-language transfer, unsupervised training based on the “multilingual A-stabil“ conﬁdence score [1], and bootstrapping. Based on the conﬁdence score we selected 72% (16.6 hours) of the available audio data with a transcription WER of less than 14.5%. The cross-language bootstrap achieves a word error rate of 23.3% on the Czech development set and 22.4% on the evaluation set. These results are very promising as the performance compares favorably to the Czech ASR system which was trained on 23 hours of

Ngoc Thang Vu, Franziska Kraus, Tanja Schultz

Real-time Traffic

Conﬁdence Score | Czech ASR | ICASSP 2011 | Signal Processing | Unsupervised Training |

claim paper

Post Info
More Details (n/a)

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Ngoc Thang Vu, Franziska Kraus, Tanja Schultz

Comments (0)

Sciweavers

Cross-language bootstrapping based on completely unsupervised training using multilingual A-stabil

Conﬁdence Score | Czech ASR | ICASSP 2011 | Signal Processing | Unsupervised Training |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers