Clip retrieval using multi-modal biometrics in meeting archives

14 years 9 months ago

Download marathon.csee.usf.edu

We present a system to retrieve all clips from a meeting archive that show a particular individual speaking, using a single face or voice sample as the query. The system incorporates three novel ideas. One, rather than match the query to each individual sample in the archive, samples within a meeting are grouped ﬁrst, generating a cluster of samples per individual. The query is then matched to the cluster, taking advantage of multiple samples to yield a robust decision. Two, automatic audio-visual association is performed which allows a bi-modal retrieval of clips, even when the query is uni-modal. Three, the biometric recognition uses individual-speciﬁc score distributions learnt from the clusters, in a likelihood ratio based decision framework that obviates the need for explicit normalization or modality weighting. The resulting system, which is completely automated, performs with 92.6% precision at 90% recall on a dataset of 16 real meetings spanning a total of 13 hours.

Himanshu Vajaria, Sudeep Sarkar, Rangachar Kasturi

Real-time Traffic

Automatic Audio-visual Association | Computer Vision | ICPR 2008 | Particular Individual Speaking | Ratio Based Decision |

claim paper

Post Info
More Details (n/a)

Added	30 May 2010
Updated	30 May 2010
Type	Conference
Year	2008
Where	ICPR
Authors	Himanshu Vajaria, Sudeep Sarkar, Rangachar Kasturi

Comments (0)

Sciweavers

Clip retrieval using multi-modal biometrics in meeting archives

Automatic Audio-visual Association | Computer Vision | ICPR 2008 | Particular Individual Speaking | Ratio Based Decision |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers