Sciweavers

ICASSP
2009
IEEE
14 years 6 months ago
An auditory-based feature for robust speech recognition
A conventional automatic speech recognizer does not perform well in the presence of noise, while human listeners are able to segregate and recognize speech in noisy conditions. We...
Yang Shao, Zhaozhang Jin, DeLiang Wang, Soundarara...
ICASSP
2009
IEEE
14 years 6 months ago
On separating glottal source and vocal tract information in telephony speaker verification
The popular mel-frequency cepstral coefficients (MFCCs) capture a mixture of speaker-related, phonemic and channel information. Speaker-related information could be further broke...
Tomi Kinnunen, Paavo Alku
ICASSP
2009
IEEE
14 years 6 months ago
Minimum variance modulation filter for robust speech recognition
This paper describes a way of designing modulation filter by datadriven analysis which improves the performance of automatic speech recognition systems that operate in real envir...
Yu-Hsiang Bosco Chiu, Richard M. Stern
ICASSP
2009
IEEE
14 years 6 months ago
Audio segmentation for speech recognition using segment features
Audio segmentation is an essential preprocessing step in several audio processing applications with a significant impact e.g. on speech recognition performance. We introduce a no...
David Rybach, Christian Gollan, Ralf Schlüter...
ICASSP
2009
IEEE
14 years 6 months ago
Filter-and-forward distributed beamforming for relay networks in frequency selective fading channels
A half-duplex distributed beamforming technique for relay networks with frequency selective fading channels is developed. The network relays use the filter-and-forward (FF) strat...
Haihua Chen, Alex B. Gershman, Shahram Shahbazpana...
ICASSP
2009
IEEE
14 years 6 months ago
A blind speech enhancement algorithm for the suppression of late reverberation and noise
This paper proposes a new speech enhancement algorithm for the suppression of background noise and late reverberation without a priori knowledge. A generalized spectral weighting ...
Heinrich W. Löllmann, Peter Vary
ICASSP
2009
IEEE
14 years 6 months ago
COSINE - A corpus of multi-party COnversational Speech In Noisy Environments
We present an overview of the data collection and transcription efforts for the COnversational Speech In Noisy Environments (COSINE) corpus. The corpus is a set of multi-party con...
Alex Stupakov, Evan Hanusa, Jeff A. Bilmes, Dieter...
ICASSP
2009
IEEE
14 years 6 months ago
Visual grouping by neural oscillators
Distributed synchronization is known to occur at several scales in the brain, and has been suggested as playing a key functional role in perceptual grouping. State-of-the-art visu...
Guoshen Yu, Jean-Jacques E. Slotine
ICASSP
2009
IEEE
14 years 6 months ago
MIMO decoding based on stochastic reconstruction from multiple projections
Least squares (LS) fitting is one of the most fundamental techniques in science and engineering. It is used to estimate parameters from multiple noisy observations. In many probl...
Amir Leshem, Jacob Goldberger