Neurophysiological studies in the primary auditory cortex have recently demonstrated a rich diversity of responses that provide an explicit multidimensional representation of phon...
We are interested in recovering aspects of vocal tract’s geometry and dynamics from auditory and visual speech cues. We approach the problem in a statistical framework based on ...
Athanassios Katsamanis, George Papandreou, Petros ...
State-of-the-art speaker diarization systems for meetings are now at a point where overlapped speech contributes significantly to the errors made by the system. However, little i...
Kofi Boakye, B. Trueba-Hornero, Oriol Vinyals, Ger...
This paper presents a set of corpus-based text-to-speech synthesis technologies for Mandarin Chinese. A large speech corpus produced by a single speaker is used, and the speech out...
Existing knowledge on how people use speech-based technologies in realistic settings is limited. We are conducting a longitudinal field study, spanning six months, to investigate ...
Jinjuan Feng, Shaojian Zhu, Ruimin Hu, Andrew Sear...