Existing large-scale articulatory databases describe the tongue shape through the 2D positions of 3–4 fixed landmarks on the tongue surface. The ability to reconstruct the full...
In an attempt to overcome problems associated with articulatory limitations and generative models, this work considers the use of phonological features in discriminative models fo...
Deep Belief Networks (DBNs) are multi-layer generative models. They can be trained to model windows of coefficients extracted from speech and they discover multiple layers of fea...
Abdel-rahman Mohamed, Tara N. Sainath, George Dahl...
In the paper we present two techniques improving the recognition accuracy of multilayer perceptron neural networks (MLP ANN) by means of adopting Speaker Adaptive Training. The us...
: We consider the matching function in vector quantization based speaker identification system. The model of a speaker is a codebook generated from the set of feature vectors from ...