We propose to include synchrony effects, known to exist in the auditory system, to represent voiced parts of the speech signal in a robust way. The system decomposes the input sig...
Patricia A. Pelle, Claudio Estienne, Horacio Franc...
We investigate methods of segmenting, visualizing, and indexing presentation videos by both audio and visual data. The audio track is segmented by speaker, and augmented with key ...
There are still no established methods for the evaluation of browsing and exploratory search tools. In the (multimedia) information retrieval community evaluations following the C...
This paper describes an application which enables the computer-assisted generation of Dublin Core-based metadata descriptions and online digital visual summaries for videos. It is...
Multimedia browsing and retrieval systems can use dimensionality reduction methods to map from high-dimensional content-based feature distributions to low-dimensional layout space...