Sciweavers

253 search results - page 20 / 51
» Robust Speech Recognition Using Neural Networks and Hidden M...
Sort
View
ICASSP
2008
IEEE
14 years 3 months ago
Multimodal information fusion using the iterative decoding algorithm and its application to audio-visual speech recognition
The fusion of information from heterogenous sensors is crucial to the effectiveness of a multimodal system. Noise affect the sensors of different modalities independently. A good ...
Shankar T. Shivappa, Bhaskar D. Rao, Mohan M. Triv...
TASLP
2002
96views more  TASLP 2002»
13 years 8 months ago
MAP speaker adaptation of state duration distributions for speech recognition
This paper presents a framework for maximum a posteriori (MAP) speaker adaptation of state duration distributions in hidden Markov models (HMM). Four key issues of MAP estimation, ...
Néstor Becerra Yoma, Jorge Silva Sán...
CSL
2002
Springer
13 years 8 months ago
Transformation streams and the HMM error model
The most popular model used in automatic speech recognition is the hidden Markov model (HMM). Though good performance has been obtained with such models there are well known limit...
M. J. F. Gales
ICASSP
2011
IEEE
13 years 13 days ago
Deep neural networks for acoustic emotion recognition: Raising the benchmarks
Deep Neural Networks (DNNs) denote multilayer artificial neural networks with more than one hidden layer and millions of free parameters. We propose a Generalized Discriminant An...
André Stuhlsatz, Christine Meyer, Florian E...
ICASSP
2009
IEEE
14 years 3 months ago
Speech emotion recognition via a max-margin framework incorporating a loss function based on the Watson and Tellegen's emotion m
This paper considers a method for speech emotion recognition by a max-margin framework incorporating a loss function based on a well-known model called the Watson and Tellegen’s...
Sungrack Yun, Chang D. Yoo