Sciweavers

2047 search results - page 83 / 410
» The limits of speech recognition
Sort
View
TASLP
2011
13 years 3 months ago
Advances in Missing Feature Techniques for Robust Large-Vocabulary Continuous Speech Recognition
— Missing feature theory (MFT) has demonstrated great potential for improving the noise robustness in speech recognition. MFT was mostly applied in the log-spectral domain since ...
Maarten Van Segbroeck, Hugo Van Hamme
SPEECH
2011
13 years 3 months ago
The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate
This paper first introduces a newly-recorded high quality Romanian speech corpus designed for speech synthesis, called “RSS”, along with Romanian front-end text processing mo...
Adriana Stan, Junichi Yamagishi, Simon King, Matth...
ACII
2007
Springer
14 years 3 months ago
Frame vs. Turn-Level: Emotion Recognition from Speech Considering Static and Dynamic Processing
Abstract. Opposing the pre-dominant turn-wise statistics of acoustic LowLevel-Descriptors followed by static classification we re-investigate dynamic modeling directly on the frame...
Bogdan Vlasenko, Björn Schuller, Andreas Wend...
ICMI
2005
Springer
170views Biometrics» more  ICMI 2005»
14 years 2 months ago
Inferring body pose using speech content
Untethered multimodal interfaces are more attractive than tethered ones because they are more natural and expressive for interaction. Such interfaces usually require robust vision...
Sy Bor Wang, David Demirdjian
ICASSP
2009
IEEE
14 years 3 months ago
Data-driven lexicon expansion for Mandarin broadcast news and conversation speech recognition
We present a data-driven framework for expanding the lexicon to improve Mandarin broadcast news and conversation speech recognition. The lexicon expansion includes the generation ...
Xin Lei, Wen Wang, Stolcke Stolcke