Sciweavers

112 search results - page 11 / 23
» Epoch Extraction From Speech Signals
Sort
View
ICASSP
2011
IEEE
14 years 9 months ago
Voxel-based Viterbi Active Speaker Tracking (V-VAST) with best view selection for video lecture post-production
An automated system is presented for reducing a multi-view lecture recording into a single view video containing a best view summary of active speakers. The system uses skin color...
Damien Kelly, Anil Kokaram, Frank Boland
220
Voted
ICMI
2004
Springer
281views Biometrics» more  ICMI 2004»
15 years 11 months ago
Articulatory features for robust visual speech recognition
Visual information has been shown to improve the performance of speech recognition systems in noisy acoustic environments. However, most audio-visual speech recognizers rely on a ...
Kate Saenko, Trevor Darrell, James R. Glass
ICASSP
2010
IEEE
15 years 5 months ago
Evaluation of random-projection-based feature combination on speech recognition
Random projection has been suggested as a means of dimensionality reduction, where the original data are projected onto a subspace using a random matrix. It represents a computati...
Tetsuya Takiguchi, Jeff Bilmes, Mariko Yoshii, Yas...
108
Voted
ICASSP
2011
IEEE
14 years 9 months ago
Talker-to-listener distance effects on the variations of the intensity and the fundamental frequency of speech
In this study we focus on the relationship between the talker-tolistener distance (TLD) and the dynamics of speech intensity and fundamental frequency. A new experiment for the ex...
Thibaut Fux, Gang Feng, Veronique Zimpfer
162
Voted
ICASSP
2011
IEEE
14 years 9 months ago
Amplitude modulation spectrogram based features for robust speech recognition in noisy and reverberant environments
In this contribution we present a feature extraction method that relies on the modulation-spectral analysis of amplitude fluctuations within sub-bands of the acoustic spectrum by ...
Niko Moritz, Jörn Anemüller, Birger Koll...