Sciweavers

112 search results - page 11 / 23
» Epoch Extraction From Speech Signals
Sort
View
ICASSP
2011
IEEE
13 years 5 days ago
Voxel-based Viterbi Active Speaker Tracking (V-VAST) with best view selection for video lecture post-production
An automated system is presented for reducing a multi-view lecture recording into a single view video containing a best view summary of active speakers. The system uses skin color...
Damien Kelly, Anil Kokaram, Frank Boland
ICMI
2004
Springer
281views Biometrics» more  ICMI 2004»
14 years 1 months ago
Articulatory features for robust visual speech recognition
Visual information has been shown to improve the performance of speech recognition systems in noisy acoustic environments. However, most audio-visual speech recognizers rely on a ...
Kate Saenko, Trevor Darrell, James R. Glass
ICASSP
2010
IEEE
13 years 8 months ago
Evaluation of random-projection-based feature combination on speech recognition
Random projection has been suggested as a means of dimensionality reduction, where the original data are projected onto a subspace using a random matrix. It represents a computati...
Tetsuya Takiguchi, Jeff Bilmes, Mariko Yoshii, Yas...
ICASSP
2011
IEEE
13 years 5 days ago
Talker-to-listener distance effects on the variations of the intensity and the fundamental frequency of speech
In this study we focus on the relationship between the talker-tolistener distance (TLD) and the dynamics of speech intensity and fundamental frequency. A new experiment for the ex...
Thibaut Fux, Gang Feng, Veronique Zimpfer
ICASSP
2011
IEEE
13 years 5 days ago
Amplitude modulation spectrogram based features for robust speech recognition in noisy and reverberant environments
In this contribution we present a feature extraction method that relies on the modulation-spectral analysis of amplitude fluctuations within sub-bands of the acoustic spectrum by ...
Niko Moritz, Jörn Anemüller, Birger Koll...