Contextual Information Improves OOV Detection in Speech

13 years 10 months ago

Download www.cs.jhu.edu

Out-of-vocabulary (OOV) words represent an important source of error in large vocabulary continuous speech recognition (LVCSR) systems. These words cause recognition failures, which propagate through pipeline systems impacting the performance of downstream applications. The detection of OOV regions in the output of a LVCSR system is typically addressed as a binary classification task, where each region is independently classified using local information. In this paper, we show that jointly predicting OOV regions, and including contextual information from each region, leads to substantial improvement in OOV detection. Compared to the state-of-the-art, we reduce the missed OOV rate from 42.6% to 28.4% at 10% false alarm rate.

Carolina Parada, Mark Dredze, Denis Filimonov, Fre

Real-time Traffic

Binary Classification Task | Computational Linguistics | False Alarm Rate | NAACL 2010 | OOV Regions |

claim paper

Post Info
More Details (n/a)

Added	14 Feb 2011
Updated	14 Feb 2011
Type	Journal
Year	2010
Where	NAACL
Authors	Carolina Parada, Mark Dredze, Denis Filimonov, Frederick Jelinek

Comments (0)

Sciweavers

Contextual Information Improves OOV Detection in Speech

Binary Classification Task | Computational Linguistics | False Alarm Rate | NAACL 2010 | OOV Regions |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers