Sciweavers

184 search results - page 26 / 37
» Introduction of the Speaking Rate in the Model of Speech Rec...
Sort
View
ICASSP
2009
IEEE
14 years 2 months ago
Lattice-based MLLR for speaker recognition
Maximum-Likelihod Linear Regression (MLLR) transform coefficients have shown to be useful features for text-independent speaker recognition systems. These use MLLR coefficients ...
Marc Ferras, Claude Barras, Jean-Luc Gauvain
ICASSP
2011
IEEE
12 years 11 months ago
Deep Belief Networks using discriminative features for phone recognition
Deep Belief Networks (DBNs) are multi-layer generative models. They can be trained to model windows of coefficients extracted from speech and they discover multiple layers of fea...
Abdel-rahman Mohamed, Tara N. Sainath, George Dahl...
INTERSPEECH
2010
13 years 2 months ago
Combination of probabilistic and possibilistic language models
In a previous paper we proposed Web-based language models relying on the possibility theory. These models explicitly represent the possibility of word sequences. In this paper we ...
Stanislas Oger, Vladimir Popescu, Georges Linar&eg...
ICASSP
2009
IEEE
14 years 2 months ago
Voice search of structured media data
This paper addresses the problem of using unstructured queries to search a structured database in voice search applications. By incorporating structural information in music metad...
Young-In Song, Ye-Yi Wang, Yun-Cheng Ju, Mike Selt...
MICAI
2007
Springer
14 years 1 months ago
An EM Algorithm to Learn Sequences in the Wavelet Domain
The wavelet transform has been used for feature extraction in many applications of pattern recognition. However, in general the learning algorithms are not designed taking into acc...
Diego H. Milone, Leandro E. Di Persia