Sciweavers

ICASSP
2011
IEEE
13 years 3 months ago
Deep neural networks for acoustic emotion recognition: Raising the benchmarks
Deep Neural Networks (DNNs) denote multilayer artificial neural networks with more than one hidden layer and millions of free parameters. We propose a Generalized Discriminant An...
André Stuhlsatz, Christine Meyer, Florian E...
ICASSP
2011
IEEE
13 years 3 months ago
Optimal and low-complexity iterative joint network/channel decoding for the multiple-access relay channel
In this paper, we investigate joint network and channel decoding algorithms for the multiple-access relay channel. We consider a realistic reference scenario with Rayleigh fading ...
Xuan-Thang Vu, Marco Di Renzo, Pierre Duhamel
ICASSP
2011
IEEE
13 years 3 months ago
A perceptually transparent audio power reduction algorithm for loudspeaker power management
Power management in loudspeakers is used to extend the battery life of portable electronics and to prevent thermal damage to the speakers. Traditionally, speaker power management ...
Leung Kin Chiu, Nathan V. Parrish, David V. Anders...
ICASSP
2011
IEEE
13 years 3 months ago
A flexible high-throughput hardware architecture for a gaussian noise generator
In this paper a exible, high-throughput, low-complexity additive white gaussian noise (AWGN) channel generator is presented. The proposed generator employs a Mersenne-Twister to g...
Ioannis Paraskevakos, Vassilis Paliouras
ICASSP
2011
IEEE
13 years 3 months ago
Why word error rate is not a good metric for speech recognizer training for the speech translation task?
Speech translation (ST) is an enabling technology for cross-lingual oral communication. A ST system consists of two major components: an automatic speech recognizer (ASR) and a ma...
Xiaodong He, Li Deng, Alex Acero
ICASSP
2011
IEEE
13 years 3 months ago
Including human expertise in speaker recognition systems: report on a pilot evaluation
The 2010 NIST Speaker Recognition Evaluation (SRE10) included a test of Human Assisted Speaker Recognition (HASR) in which systems based in whole or in part on human expertise wer...
Craig S. Greenberg, Alvin F. Martin, George R. Dod...
ICASSP
2011
IEEE
13 years 3 months ago
Prediction of discrete cosine transformed coefficients in resized pixel blocks
Abstract— A hybrid model was developed to predict the zeroquantized discrete cosine transform (ZQDCT) coefficients for intra blocks in our previous work. However, the complicated...
Jin Li, Weiwei Chen, Moncef Gabbouj, Jarmo Takala,...
ICASSP
2011
IEEE
13 years 3 months ago
On the ergodic capacity of jointly-correlated rician Fading MIMO channels
—In this paper, we study the capacity-achieving input covariance matrices for the jointly-correlated (or the Weichselberger) Rician fading multiple-input multiple-output (MIMO) a...
Chao-Kai Wen, Shi Jin, Kai-Kit Wong, Jung-Chieh Ch...
ICASSP
2011
IEEE
13 years 3 months ago
Data-driven fMRI group classification using connected components and Gaussian process classifiers
Functional magnetic resonance imaging (fMRI) is a popular tool for studying brain activity due to its non-invasiveness. Conventionally an expected response needs to be available f...
Sarah Lee, Fernando Zelaya, Yohan Samarasinghe, St...
ICASSP
2011
IEEE
13 years 3 months ago
Acoustic data sharing for Afghan and Persian languages
In this work, we compare several known approaches for multilingual acoustic modeling for three languages, Dari, Farsi and Pashto, which are of recent geo-political interest. We de...
Arindam Mandal, Dimitra Vergyri, Murat Akbacak, Co...