Sciweavers

334 search results - page 53 / 67
» Improving speech playback using time-compression and speech ...
Sort
View
ICASSP
2011
IEEE
12 years 10 months ago
Training of error-corrective model for ASR without using audio data
This paper introduces a method to train an error-corrective model for Automatic Speech Recognition (ASR) without using audio data. In existing techniques, it is assumed that sufï¬...
Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura
ICIP
2000
IEEE
14 years 8 months ago
LIP Contour Extraction Using a Deformable Model
The use of visual information from lip movements can improve the accuracy and robustness of a speech recognition system. Accurate extraction of visual features associated with the...
Alan Wee-Chung Liew, Shu Hung Leung, Wing Hong Lau
ICASSP
2007
IEEE
14 years 1 months ago
Unsupervised Audio Segmentation using Extended Baum-Welch Transformations
Audio segmentation has applications in a variety of contexts, such as audio information retrieval, automatic sound analysis, and as a pre-processing step in speech recognition. Ex...
Tara N. Sainath, Dimitri Kanevsky, Giridharan Iyen...
ICASSP
2010
IEEE
13 years 7 months ago
Improved statistical models for SMT-based speaking style transformation
Automatic speech recognition (ASR) results contain not only ASR errors, but also disfluencies and colloquial expressions that must be corrected to create readable transcripts. We...
Graham Neubig, Yuya Akita, Shinsuke Mori, Tatsuya ...
COST
2009
Springer
203views Multimedia» more  COST 2009»
14 years 1 months ago
Multiple Feature Extraction and Hierarchical Classifiers for Emotions Recognition
Abstract. The recognition of the emotional states of speaker is a multidisciplinary research area that has received great interest in the last years. One of the most important goal...
Enrique M. Albornoz, Diego H. Milone, Hugo Leonard...