Sciweavers

NAACL
2010

Contextual Information Improves OOV Detection in Speech

13 years 10 months ago
Contextual Information Improves OOV Detection in Speech
Out-of-vocabulary (OOV) words represent an important source of error in large vocabulary continuous speech recognition (LVCSR) systems. These words cause recognition failures, which propagate through pipeline systems impacting the performance of downstream applications. The detection of OOV regions in the output of a LVCSR system is typically addressed as a binary classification task, where each region is independently classified using local information. In this paper, we show that jointly predicting OOV regions, and including contextual information from each region, leads to substantial improvement in OOV detection. Compared to the state-of-the-art, we reduce the missed OOV rate from 42.6% to 28.4% at 10% false alarm rate.
Carolina Parada, Mark Dredze, Denis Filimonov, Fre
Added 14 Feb 2011
Updated 14 Feb 2011
Type Journal
Year 2010
Where NAACL
Authors Carolina Parada, Mark Dredze, Denis Filimonov, Frederick Jelinek
Comments (0)