Audio-visual speaker diarisation is the task of estimating “who spoke when” using audio and visual cues. In this paper we propose the combination of an audio diarisation syste...
A real-time audio segmentation and indexing scheme is presented in this paper. Audio recordings are segmented and classified into basic audio types such as silence, speech, music,...
Image retrieval has commonly been attempted using non-semantic approaches. It is clear though, that semantic retrieval is more desirable because it facilitates the user's tas...
Abstract. In this paper, a new approach for retrieval from semistructured photographic collections is described. We have developed a retrieval model based on the Dempster-Shafer th...
This paper discusses the application of speech alignment, image processing, and language understanding technologies to build efficient interfaces into large digital oral history a...
Michael G. Christel, Julieanna Richardson, Howard ...