Sciweavers

67 search results - page 9 / 14
» Efficient Gaussian Mixture for Speech Recognition
Sort
View
ICASSP
2011
IEEE
13 years 9 days ago
Maximum likelihood adaptation of histogram equalization with constraint for robust speech recognition
In this paper, we propose a novel feature space adaptation technique to improve the robustness of speech recognition in noisy environments. Histogram equalization (HEQ) is an effe...
Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li
SMC
2007
IEEE
14 years 2 months ago
Mental tension detection in the speech based on physiological monitoring
— The focus of this paper is mental tension detection in speech to assist control the tension in day-to-day business such as conferences and operations in a call center. It is di...
Michiaki Ariga, Yoshikazu Yano, Shinji Doki, Shige...
ICASSP
2011
IEEE
13 years 9 days ago
Arccosine kernels: Acoustic modeling with infinite neural networks
Neural networks are a useful alternative to Gaussian mixture models for acoustic modeling; however, training multilayer networks involves a difficult, nonconvex optimization that...
Chih-Chieh Cheng, Brian Kingsbury
CVPR
2012
IEEE
11 years 11 months ago
Robust Boltzmann Machines for recognition and denoising
While Boltzmann Machines have been successful at unsupervised learning and density modeling of images and speech data, they can be very sensitive to noise in the data. In this pap...
Yichuan Tang, Ruslan Salakhutdinov, Geoffrey E. Hi...
CLEAR
2007
Springer
271views Biometrics» more  CLEAR 2007»
14 years 2 months ago
The AIT Multimodal Person Identification System for CLEAR 2007
This paper presents the person identification system developed at Athens Information Technology and its performance in the CLEAR 2007 evaluations. The system operates on the audiov...
Andreas Stergiou, Aristodemos Pnevmatikakis, Lazar...