Sciweavers

67 search results - page 13 / 14
» Multilingual acoustic modeling for speech recognition based ...
Sort
View
ICASSP
2009
IEEE
14 years 2 months ago
Detecting bandlimited audio in broadcast television shows
For TV and radio shows containing narrowband speech, Speech-to-text (STT) accuracy on the narrowband audio can be improved by using an acoustic model trained on acoustically match...
Mark C. Fuhs, Qin Jin, Tanja Schultz
TCSV
2008
125views more  TCSV 2008»
13 years 6 months ago
Exploring Co-Occurence Between Speech and Body Movement for Audio-Guided Video Localization
This paper presents a bottom-up approach that combines audio and video to simultaneously locate individual speakers in the video (2-D source localization) and segment their speech ...
H. Vajaria, S. Sarkar, R. Kasturi
NOLISP
2005
Springer
14 years 28 days ago
MLP Internal Representation as Discriminative Features for Improved Speaker Recognition
Feature projection by non-linear discriminant analysis (NLDA) can substantially increase classification performance. In automatic speech recognition (ASR) the projection provided b...
Dalei Wu, Andrew C. Morris, Jacques C. Koreman
ICIP
2004
IEEE
14 years 9 months ago
Statistical transformations of frontal models for non-frontal face verification
In the framework of a face verification system using local features and a Gaussian Mixture Model based classifier, we address the problem of non-frontal face verification (when on...
Conrad Sanderson, Samy Bengio
COST
2009
Springer
203views Multimedia» more  COST 2009»
14 years 2 months ago
Multiple Feature Extraction and Hierarchical Classifiers for Emotions Recognition
Abstract. The recognition of the emotional states of speaker is a multidisciplinary research area that has received great interest in the last years. One of the most important goal...
Enrique M. Albornoz, Diego H. Milone, Hugo Leonard...