In this paper we demonstrate that Long Short-Term Memory (LSTM) is a differentiable recurrent neural net (RNN) capable of robustly categorizing timewarped speech data. We measure ...
Homograph ambiguity is an original issue in Text-to-Speech (TTS). To disambiguate homograph, several efficient approaches have been proposed such as part-of-speech (POS) n-gram, B...
This paper presents a new vision-based obstacle detection method for mobile robots. Each individual image pixel is classified as belonging either to an obstacle or the ground base...
Motionanalysis often relies on differencing operations that inherently amplify noise and are hindered by the spatial correspondenceproblem.Analternative approach is proposedusing ...
In this paper, we investigate the behavior of Gabor responses at automatically located facial feature points for face recognition. In our approach, a set of feature points on the ...