Sciweavers

IRAL
2000
ACM

Content-based language models for spoken document retrieval

14 years 5 months ago
Content-based language models for spoken document retrieval
Spoken document retrieval (SDR) has been extensively studied in recent years because of its potential use in navigating large multimedia collections in the near future. This paper presents a novel concept of applying content-based language models to spoken document retrieval. In an example task for retrieval of Mandarin Chinese broadcast news data, the content-based language models either trained on automatic transcriptions of spoken documents or adapted from baseline language models using automatic transcriptions of spoken documents were used to create more accurate recognition results and indexing terms from both spoken documents and speech queries. We report on some interesting findings obtained in this research.
Hsin-Min Wang, Berlin Chen
Added 01 Aug 2010
Updated 01 Aug 2010
Type Conference
Year 2000
Where IRAL
Authors Hsin-Min Wang, Berlin Chen
Comments (0)