— A solution for the slow convergence of most learning rules for Recurrent Neural Networks (RNN) has been proposed under the terms Liquid State Machines (LSM) and Echo State Netw...
David Verstraeten, Benjamin Schrauwen, Dirk Stroob...
Preparation, recording, segmentation and pitch labelling of Slovenian diphone inventories are described. A special user friendly intert'ace package was developed in order to ...
Jerneja Gros, Ivo Ipsic, Simon Dobrisek, France Mi...
Voice conversion has become more and more important in speech technology, but most of current works have to use parallel utterances of both source and target speaker as the traini...
Traditionally, facial expression recognition (FER) issues have been studied mostly based on modalities of 2D images, 2D videos, and 3D static models. In this paper, we propose a sp...
In the present work we study the appropriateness of a number of linear and non-linear regression methods, employed on the task of speech segmentation, for combining multiple phone...