A new, linguistically annotated, video database for automatic sign language recognition is presented. The new RWTH-BOSTON-400 corpus, which consists of 843 sentences, several spea...
Philippe Dreuw, Carol Neidle, Vassilis Athitsos, S...
This paper describes a new method for fast speaker adaptation in large vocabulary recognition systems. As in most HMM-based recognizers, the observation densities are modeled as a...
Jacques Duchateau, Tobias Leroy, Kris Demuynck, Hu...
Remote participants in hybrid meetings often have problems to follow what is going on in the (physical) meeting room they are connected with. This paper describes a videoconferenci...
Rieks op den Akker, Dennis Hofs, Hendri Hondorp, H...
Face-to-face meetings usually encompass several modalities including speech, gesture, handwriting, and person identification. Recognition and integration of each of these modalit...
Ralph Gross, Michael Bett, Hua Yu, Xiaojin Zhu, Yu...
The aim of this paper is to compare different log-likelihood scoring methods, that different sites used in the latest state-of-the-art Joint Factor Analysis (JFA) Speaker Recognit...
Ondrej Glembek, Lukas Burget, Najim Dehak, Niko Br...