Integrating frame-based and segment-based dynamic time warping for unsupervised spoken term detection with spoken queries

13 years 10 months ago

Download mirlab.org

Rapidly increasing quantities of multimedia and spoken content today demand fast and accurate retrieval approaches for convenient browsing. The spoken documents with wide variety of different acoustic and linguistic conditions make supervised training of well-matched acoustic/language models very difﬁcult. Unsupervised methods using frame-based dynamic time warping (DTW) require no acoustic/language models but with high computation load. Therefore, segment-based DTW was proposed to relieve the computation load at the cost of degraded detection performance. In this paper, we reﬁne the segment-based DTW by allowing deletion of end segments of query to improve detection performance. The search space is also reduced by segment similarity constraints. We also proposed a two-pass framework. The segment-baed DTW is performed in the ﬁrst pass to locate hypothesized spoken term region and the frame-based DTW for precise rescoring in the second pass. Then the pseudo relevance feedback is ...

Chun-an Chan, Lin-Shan Lee

Real-time Traffic

Computation Load | Detection Performance | ICASSP 2011 | Segment-based Dtw | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Chun-an Chan, Lin-Shan Lee

Comments (0)

Sciweavers

Integrating frame-based and segment-based dynamic time warping for unsupervised spoken term detection with spoken queries

Computation Load | Detection Performance | ICASSP 2011 | Segment-based Dtw | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers