Sciweavers

108 search results - page 11 / 22
» High-level approaches to confidence estimation in speech rec...
Sort
View
TASLP
2008
106views more  TASLP 2008»
13 years 7 months ago
Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model
A method is described for estimating the fundamental frequencies of several concurrent sounds in polyphonic music and multiple-speaker speech signals. The method consists of a comp...
Anssi Klapuri
ICMI
2005
Springer
170views Biometrics» more  ICMI 2005»
14 years 27 days ago
Inferring body pose using speech content
Untethered multimodal interfaces are more attractive than tethered ones because they are more natural and expressive for interaction. Such interfaces usually require robust vision...
Sy Bor Wang, David Demirdjian
ICASSP
2009
IEEE
14 years 2 months ago
A criterion for the enhancement of time-frequency masks in missing data recognition
Despite their effectiveness for robust speech processing, missing data techniques are vulnerable to errors in the classification of the input speech signal’s time-frequency poi...
Daniel Pullella, Roberto Togneri
TASLP
2008
154views more  TASLP 2008»
13 years 7 months ago
Capturing Local Variability for Speaker Normalization in Speech Recognition
The new model reduces the impact of local spectral and temporal variability by estimating a finite set of spectral and temporal warping factors which are applied to speech at the f...
Antonio Miguel, Eduardo Lleida, Richard Rose, Luis...
ICASSP
2009
IEEE
14 years 2 months ago
Audio segmentation for speech recognition using segment features
Audio segmentation is an essential preprocessing step in several audio processing applications with a significant impact e.g. on speech recognition performance. We introduce a no...
David Rybach, Christian Gollan, Ralf Schlüter...