Our previous analysis of speaker-adaptive HMM-based speech synthesis methods suggested that there are two possible reasons why average voices can obtain higher subjective scores t...
Sandra Andraszewicz, Junichi Yamagishi, Simon King
Although research has previously been done on multilingual speech recognition, it has been found to be very difficult to improve over separately trained systems. The usual approa...
Lukas Burget, Petr Schwarz, Mohit Agarwal, Pinar A...
An essential step in the generation of expressive speech synthesis is the automatic detection and classification of emotions most likely to be present in textual input. At last I...
In this paper, we propose a novel phase preserving spectral subtraction method for enhancing speech in noise. Instead of the conventional approach of carrying out subtraction on t...
Audio segmentation is an essential preprocessing step in several audio processing applications with a significant impact e.g. on speech recognition performance. We introduce a no...