Automatic speech recognition system channel modeling

15 years 1 months ago

Download www-scf.usc.edu

In this paper, we present a systems approach for channel modeling of an Automatic Speech Recognition (ASR) system. This can have implications in improving speech recognition components, such as through discriminative language modeling. We simulate the ASR corruption using a phrase-based machine translation system trained between the reference phoneme and output phoneme sequences of a real ASR. We demonstrate that local optimization on the quality of phoneme-to-phoneme mappings does not directly translate to overall improvement of the entire model. However, we are still able to capitalize on contextual information of the phonemes which a simple acoustic distance model is not able to accomplish. Hence we show that the use of longer context results in a significantly improved model of the ASR channel.

Qun Feng Tan, Kartik Audhkhasi, Panayiotis G. Geor

Real-time Traffic

Automatic Speech Recognition | INTERSPEECH 2010 | Output Phoneme Sequences | Signal Processing | Speech Recognition |

claim paper

» Automatic speech recognition performance on a voicemail transcription task

» Onthefly lattice rescoring for realtime automatic speech recognition

» Thai spelling analysis for automatic spelling speech recognition

» Speech Recognition Model for Tamil Stops

» Minimum variance modulation filter for robust speech recognition

» Speech Recognition System of Arabic Digits based on A Telephony Arabic Corpus

» Automatic speech recognition for assistive writing in speech supplemented word prediction

» Lowcomplexity automatic speaker recognition in the compressed GSM AMR domain

Post Info
More Details (n/a)

Added	18 May 2011
Updated	18 May 2011
Type	Journal
Year	2010
Where	INTERSPEECH
Authors	Qun Feng Tan, Kartik Audhkhasi, Panayiotis G. Georgiou, Emil Ettelaie, Shrikanth S. Narayanan

Comments (0)

Sciweavers

Automatic speech recognition system channel modeling

Automatic Speech Recognition | INTERSPEECH 2010 | Output Phoneme Sequences | Signal Processing | Speech Recognition |

Explore & Download

Productivity Tools

Sciweavers