This paper investigates the problem of incorporating auxiliary information (e.g. pitch) for speech recognition using dynamic Bayesian networks (DBNs). Previous works usually model...
For face recognition from video streams often cues such as transcripts, subtitles or on-screen text are available. This information could be very valuable for improving the recogni...
Psychophysical studies have shown that humans actively exploit temporal information such as contiguity of images in object recognition. We have recently developed a recognition sy...
Arnulf B. A. Graf, Christian Wallraven, Heinrich H...
This paper proposes a model-based methodology for recognizing and tracking objects in digital image sequences. Objects are represented by attributed relational graphs (or ARGs), w...
Ana Beatriz V. Graciano, Roberto Marcondes Cesar J...
Information Fusion of multi-modal Biometrics has attracted much attention in recent years. However, this paper focuses on the information fusion in single modals, that is, the fac...
Bo Cao, Peng Yang, Shiguang Shan, Wen Gao, Wenchao...