Combining LVCSR and vocabulary-independent ranked utterance retrieval for robust speech search

14 years 6 months ago

Download web.jhu.edu

Well tuned Large-Vocabulary Continuous Speech Recognition (LVCSR) has been shown to generally be more eﬀective than vocabulary-independent techniques for ranked retrieval of spoken content when one or the other approach is used alone. Tuning LVCSR systems to a topic domain can be costly, however, and the experiments in this paper show that Out-Of-Vocabulary (OOV) query terms can significantly reduce retrieval eﬀectiveness when that tuning is not performed. Further experiments demonstrate, however, that retrieval eﬀectiveness for queries with OOV terms can be substantially improved by combining evidence from LVCSR with additional evidence from vocabulary-independent Ranked Utterance Retrieval (RUR). The combination is performed by using relevance judgments from held-out topics to learn generic (i.e., topic-independent), smooth, non-decreasing transformations from LVCSR and RUR system scores to probabilities of topical relevance. Evaluated using a CLEF collection that includes top...

J. Scott Olsson, Douglas W. Oard

Real-time Traffic

Information Retrieval | Large-Vocabulary Continuous Speech | Relevance Judgments | Retrieval Eﬀectiveness | SIGIR 2009 |

claim paper

Post Info
More Details (n/a)

Added	28 May 2010
Updated	28 May 2010
Type	Conference
Year	2009
Where	SIGIR
Authors	J. Scott Olsson, Douglas W. Oard

Comments (0)

Sciweavers

Combining LVCSR and vocabulary-independent ranked utterance retrieval for robust speech search

Information Retrieval | Large-Vocabulary Continuous Speech | Relevance Judgments | Retrieval Eﬀectiveness | SIGIR 2009 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers