Search Sciweavers | Sciweavers

157

Voted

ICASSP
2011
IEEE

149views Signal Processing» more ICASSP 2011»

Voxel-based Viterbi Active Speaker Tracking (V-VAST) with best view selection for video lecture post-production

14 years 9 months ago

An automated system is presented for reducing a multi-view lecture recording into a single view video containing a best view summary of active speakers. The system uses skin color...

Damien Kelly, Anil Kokaram, Frank Boland

claim paper

Read More »

224

click to vote

ICMI
2004
Springer

281views Biometrics» more ICMI 2004»

Articulatory features for robust visual speech recognition

15 years 11 months ago

Download people.csail.mit.edu

Visual information has been shown to improve the performance of speech recognition systems in noisy acoustic environments. However, most audio-visual speech recognizers rely on a ...

Kate Saenko, Trevor Darrell, James R. Glass

claim paper

Read More »

188

click to vote

ICASSP
2010
IEEE

158views Signal Processing» more ICASSP 2010»

Evaluation of random-projection-based feature combination on speech recognition

15 years 6 months ago

Download www.me.cs.scitec.kobe-u.ac.jp

Random projection has been suggested as a means of dimensionality reduction, where the original data are projected onto a subspace using a random matrix. It represents a computati...

Tetsuya Takiguchi, Jeff Bilmes, Mariko Yoshii, Yas...

claim paper

Read More »

113

click to vote

ICASSP
2011
IEEE

167views Signal Processing» more ICASSP 2011»

Talker-to-listener distance effects on the variations of the intensity and the fundamental frequency of speech

14 years 9 months ago

Download mirlab.org

In this study we focus on the relationship between the talker-tolistener distance (TLD) and the dynamics of speech intensity and fundamental frequency. A new experiment for the ex...

Thibaut Fux, Gang Feng, Veronique Zimpfer

claim paper

Read More »

165

click to vote

ICASSP
2011
IEEE

166views Signal Processing» more ICASSP 2011»

Amplitude modulation spectrogram based features for robust speech recognition in noisy and reverberant environments

14 years 9 months ago

Download mirlab.org

In this contribution we present a feature extraction method that relies on the modulation-spectral analysis of amplitude fluctuations within sub-bands of the acoustic spectrum by ...

Niko Moritz, Jörn Anemüller, Birger Koll...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers