Sciweavers

ICMCS
2000
IEEE

Information Access using Speech, Speaker and Face Recognition

14 years 4 months ago
Information Access using Speech, Speaker and Face Recognition
We describe a scheme to combine the results of audio and face identification for multimedia indexing and retrieval. Audio analysis consists of speech and speaker recognition derived from broadcast news video clip. The video component is analyzed to identify the persons in the same video clip using face recognition. When applied individually both speaker and face recognition schemes have limitations on conditions under which they perform reasonably well. By integrating the match-score results of both audio and video analysis, we find that the two techniques can complement each other. We discuss the system architecture for such a combined system, and discuss how decision fusion is applied to disparate match-scoring systems to yield the final speaker identity.
Mahesh Viswanathan, Homayoon S. M. Beigi, Alain Tr
Added 31 Jul 2010
Updated 31 Jul 2010
Type Conference
Year 2000
Where ICMCS
Authors Mahesh Viswanathan, Homayoon S. M. Beigi, Alain Tritschler, Fereydoun Maali
Comments (0)