Sciweavers

ICASSP
2011
IEEE
13 years 3 months ago
A flexible high-throughput hardware architecture for a gaussian noise generator
In this paper a exible, high-throughput, low-complexity additive white gaussian noise (AWGN) channel generator is presented. The proposed generator employs a Mersenne-Twister to g...
Ioannis Paraskevakos, Vassilis Paliouras
ICASSP
2011
IEEE
13 years 3 months ago
Why word error rate is not a good metric for speech recognizer training for the speech translation task?
Speech translation (ST) is an enabling technology for cross-lingual oral communication. A ST system consists of two major components: an automatic speech recognizer (ASR) and a ma...
Xiaodong He, Li Deng, Alex Acero
ICASSP
2011
IEEE
13 years 3 months ago
Including human expertise in speaker recognition systems: report on a pilot evaluation
The 2010 NIST Speaker Recognition Evaluation (SRE10) included a test of Human Assisted Speaker Recognition (HASR) in which systems based in whole or in part on human expertise wer...
Craig S. Greenberg, Alvin F. Martin, George R. Dod...
ICASSP
2011
IEEE
13 years 3 months ago
Prediction of discrete cosine transformed coefficients in resized pixel blocks
Abstract— A hybrid model was developed to predict the zeroquantized discrete cosine transform (ZQDCT) coefficients for intra blocks in our previous work. However, the complicated...
Jin Li, Weiwei Chen, Moncef Gabbouj, Jarmo Takala,...
ICASSP
2011
IEEE
13 years 3 months ago
On the ergodic capacity of jointly-correlated rician Fading MIMO channels
—In this paper, we study the capacity-achieving input covariance matrices for the jointly-correlated (or the Weichselberger) Rician fading multiple-input multiple-output (MIMO) a...
Chao-Kai Wen, Shi Jin, Kai-Kit Wong, Jung-Chieh Ch...
ICASSP
2011
IEEE
13 years 3 months ago
Data-driven fMRI group classification using connected components and Gaussian process classifiers
Functional magnetic resonance imaging (fMRI) is a popular tool for studying brain activity due to its non-invasiveness. Conventionally an expected response needs to be available f...
Sarah Lee, Fernando Zelaya, Yohan Samarasinghe, St...
ICASSP
2011
IEEE
13 years 3 months ago
Acoustic data sharing for Afghan and Persian languages
In this work, we compare several known approaches for multilingual acoustic modeling for three languages, Dari, Farsi and Pashto, which are of recent geo-political interest. We de...
Arindam Mandal, Dimitra Vergyri, Murat Akbacak, Co...
ICASSP
2011
IEEE
13 years 3 months ago
Prosodic control of unit-selection speech synthesis: A probabilistic approach
One problem in concatenative speech synthesis is how to incorporate prosodic factors in the unit selection. Imposing a predicted prosodic target is error-prone and does not benefi...
Christophe Veaux, Xavier Rodet
ICASSP
2011
IEEE
13 years 3 months ago
Dynamic selection of a speech enhancement method for robust speech recognition in moving motorcycle environment
We present a speech pre-processing scheme (SPPS) for robust speech recognition in the moving motorcycle environment. The SPPS is dynamically adapted during the run-time operation ...
Iosif Mporas, Todor Ganchev, Otilia Kocsis, Nikos ...
ICASSP
2011
IEEE
13 years 3 months ago
An acoustically-motivated spatial prior for under-determined reverberant source separation
We consider the task of under-determined reverberant audio source separation. We model the contribution of each source to all mixture channels in the time-frequency domain as a ze...
Ngoc Q. K. Duong, Emmanuel Vincent, Rémi Gr...