Semi-automated update of automatic transcription system for the Japanese national congress

15 years 1 months ago

Download www.ar.media.kyoto-u.ac.jp

Update of acoustic and language models is vital to maintain performance of automatic speech recognition (ASR) systems. To alleviate efforts for updating models, we propose a "semi-automated" framework for the ASR system of the Japanese National Congress. The framework consists of our speaking-style transformation (SST) and lightly-supervised training (LSV) approaches, which can automatically generate spoken-style training texts and labels from documents like meeting minutes. An experimental evaluation demonstrated that this update framework improved the ASR performance for the latest meeting data. We also address an estimation method of the ASR accuracy based on SST, which uses minutes as reference texts and does not require verbatim transcripts.

Yuya Akita, Masato Mimura, Graham Neubig, Tatsuya

Real-time Traffic

ASR Accuracy | Automatic Speech Recognition | INTERSPEECH 2010 | Japanese National Congress | Signal Processing |

claim paper

Added	18 May 2011
Updated	18 May 2011
Type	Journal
Year	2010
Where	INTERSPEECH
Authors	Yuya Akita, Masato Mimura, Graham Neubig, Tatsuya Kawahara

Sciweavers

Semi-automated update of automatic transcription system for the Japanese national congress

ASR Accuracy | Automatic Speech Recognition | INTERSPEECH 2010 | Japanese National Congress | Signal Processing |

Explore & Download

Productivity Tools

Sciweavers