Sciweavers

SIGIR
2000
ACM

Effects of out of vocabulary words in spoken document retrieval

14 years 3 months ago
Effects of out of vocabulary words in spoken document retrieval
The effects of out-of-vocabulary (OOV) items in spoken document retrieval (SDR) are investigated. Several sets of transcriptions were created for the TREC-8 SDR task using a speech recognition system varying the vocabulary sizes and OOV rates, and the relative retrieval performance measured. The effects of OOV terms on a simple baseline IR system and on more sophisticated retrieval systems are described. The use of a parallel corpus for query and document expansion is found to be especially beneficial, and with this data set, good retrieval performance can be achieved even for fairly high OOV rates.
Philip C. Woodland, Sue E. Johnson, P. Jourlin, Ka
Added 25 Aug 2010
Updated 25 Aug 2010
Type Conference
Year 2000
Where SIGIR
Authors Philip C. Woodland, Sue E. Johnson, P. Jourlin, Karen Sparck Jones
Comments (0)