Improved statistical models for SMT-based speaking style transformation

15 years 7 months ago

Download www.ar.media.kyoto-u.ac.jp

Automatic speech recognition (ASR) results contain not only ASR errors, but also disﬂuencies and colloquial expressions that must be corrected to create readable transcripts. We take the approach of statistical machine translation (SMT) to “translate” from ASR results into transcript-style text. We introduce two novel modeling techniques in this framework: a context-dependent translation model, which allows for usage of context to accurately model translation probabilities, and log-linear interpolation of conditional and joint probabilities, which allows for frequently observed translation patterns to be given higher priority. The system is implemented using weighted ﬁnite state transducers (WFST). On an evaluation using ASR results and manual transcripts of meetings of the Japanese Diet (national congress), the proposed methods showed a signiﬁcant increase in accuracy over traditional modeling techniques.

Graham Neubig, Yuya Akita, Shinsuke Mori, Tatsuya

Real-time Traffic

ASR Errors | ICASSP 2010 | Modeling Techniques | Signal Processing | Statistical Machine Translation |

claim paper

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Graham Neubig, Yuya Akita, Shinsuke Mori, Tatsuya Kawahara

Comments (0)

Sciweavers

Improved statistical models for SMT-based speaking style transformation

ASR Errors | ICASSP 2010 | Modeling Techniques | Signal Processing | Statistical Machine Translation |

Explore & Download

Productivity Tools

Sciweavers