Sciweavers

ICASSP
2010
IEEE

Balancing false alarms and hits in Spoken Term Detection

13 years 11 months ago
Balancing false alarms and hits in Spoken Term Detection
This paper presents methods to improve retrieval of Out-OfVocabulary (OOV) terms in a Spoken Term Detection (STD) system. We demonstrate that automated tagging of OOV regions helps to reduce false alarms while incorporating phonetic confusability increases the hits. Additional features that boost the probability of a hit in accordance with the number of neighboring hits for the same query and query-length normalization also improve the overall performance of the spokenterm detection system. We show that these methods can be combined effectively to provide a relative improvement of 21% in Average Term Weighted Value (ATWV) on a 100hour corpus with 1290 OOV-only queries and 2% relative on the NIST 2006 STD task, where only 16 of the 1107 queries were OOV terms. Lastly, we present results to show that the proposed methods are general enough to work well in queryby-example based spoken-term detection, and in mismatched situations when the representation of the index being searched through...
Carolina Parada, Abhinav Sethy, Bhuvana Ramabhadra
Added 06 Dec 2010
Updated 06 Dec 2010
Type Conference
Year 2010
Where ICASSP
Authors Carolina Parada, Abhinav Sethy, Bhuvana Ramabhadran
Comments (0)