Sciweavers

334 search results - page 42 / 67
» Improving speech playback using time-compression and speech ...
Sort
View
ICMI
2007
Springer
161views Biometrics» more  ICMI 2007»
14 years 2 months ago
Detecting communication errors from visual cues during the system's conversational turn
Automatic detection of communication errors in conversational systems has been explored extensively in the speech community. However, most previous studies have used only acoustic...
Sy Bor Wang, David Demirdjian, Trevor Darrell
CSL
2000
Springer
13 years 8 months ago
Pronunciation modeling by sharing Gaussian densities across phonetic models
Conversational speech exhibits considerable pronunciation variability, which has been shown to have a detrimental effect on the accuracy of automatic speech recognition. There hav...
Murat Saraclar, Harriet J. Nock, Sanjeev Khudanpur
LREC
2008
81views Education» more  LREC 2008»
13 years 10 months ago
Speech Errors on Frequently Observed Homophones in French: Perceptual Evaluation vs Automatic Classification
The present contribution aims at increasing our understanding of automatic speech recognition (ASR) errors involving frequent homophone or almost homophone words by confronting th...
Rena Nemoto, Ioana Vasilescu, Martine Adda-Decker
ICASSP
2008
IEEE
14 years 2 months ago
Corrected tandem features for acoustic model training
This paper describes a simple method for significantly improving Tandem features used to train acoustic models for large-vocabulary speech recognition. The linear activations at ...
Arlo Faria, Nelson Morgan
ICASSP
2011
IEEE
13 years 5 days ago
Robust speaker identification using a CASA front-end
Speaker recognition remains a challenging task under noisy conditions. Inspired by auditory perception, computational auditory scene analysis (CASA) typically segregates speech by...
Xiaojia Zhao, Yang Shao, DeLiang Wang