For the task of detecting shouted speech in a noisy environment, this paper introduces a system based on mel frequency cepstral coefficient (MFCC) feature extraction, unsupervise...
This paper addresses the detection of OOV segments in the output of large vocabulary continuous speech recognition (LVCSR) system. First, standard confidence measures based on fr...
Lukas Burget, Petr Schwarz, Pavel Matejka, Mirko H...
Polish is a synthetic language with a high morpheme-perword ratio. It makes use of a high degree of inflection leading to high out-of-vocabulary (OOV) rates, and high Language Mo...
M. Ali Basha Shaik, Amr El-Desoky Mousa, Ralf Schl...
The Informedia Experience-on-Demand system uses speech, image, and natural language processing combined with GPS information to capture, integrate, and communicate personal multim...
Howard D. Wactlar, Michael G. Christel, Alexander ...
Automatic Speech Recognition (ASR) systems continue to make errors during search when handling various phenomena including noise, pronunciation variation, and out of vocabulary (O...
Christopher M. White, Geoffrey Zweig, Lukas Burget...