Sciweavers

ICASSP
2011
IEEE
12 years 11 months ago
Time domain reconstruction of spatial sound fields using compressed sensing
A novel technique for time domain spatial sound reproduction using compressed sensing is presented. The presented technique is based on the application of compressed sensing theor...
Andrew Wabnitz, Nicolas Epain, André van Sc...
ICASSP
2011
IEEE
12 years 11 months ago
Degenerate Unmixing Estimation Technique using the Constant Q Transform
The Degenerate Unmixing Estimation Technique (DUET) is a Blind Source Separation (BSS) algorithm for stereo audio. DUET depends on an amplitude-phase 2d histogram built from the d...
Zafar Rafii, Bryan Pardo
ICASSP
2011
IEEE
12 years 11 months ago
Exposing duplicated regions affected by reflection, rotation and scaling
A commonly considered image manipulation is to conceal undesirable objects or people in the scene with a region of pixels copied from the same image. Forensic mechanisms aimed at ...
Sergio Bravo-Solorio, Asoke K. Nandi
ICASSP
2011
IEEE
12 years 11 months ago
Game-theoretic resource allocation in relay-assisted DS/CDMA systems with successive interference cancellation
The problem of non-cooperative resource allocation in an amplifyand-forward relay-assisted DS/CDMA system is addressed. The relay designs its amplify-and-forward matrix for achiev...
Alessio Zappone, Eduard A. Jorswieck
ICASSP
2011
IEEE
12 years 11 months ago
Using morpheme and syllable based sub-words for polish LVCSR
Polish is a synthetic language with a high morpheme-perword ratio. It makes use of a high degree of inflection leading to high out-of-vocabulary (OOV) rates, and high Language Mo...
M. Ali Basha Shaik, Amr El-Desoky Mousa, Ralf Schl...
ICASSP
2011
IEEE
12 years 11 months ago
Unsupervised determination of efficient Korean LVCSR units using a Bayesian Dirichlet process model
Korean is an agglutinative language that does not have explicit word boundaries. It is also a highly inflective language that exhibits severe coarticulation effects. These charac...
Sakriani Sakti, Andrew M. Finch, Ryosuke Isotani, ...
ICASSP
2011
IEEE
12 years 11 months ago
Learning non-parametric models of pronunciation
As more data becomes available for a given speech recognition task, the natural way to improve recognition accuracy is to train larger models. But, while this strategy yields mode...
Brian Hutchinson, Jasha Droppo
ICASSP
2011
IEEE
12 years 11 months ago
Sparse variable reduced rank regression via Stiefel optimization
Reduced rank regression (RRR) has found application in various fields of signal processing. In this paper we propose a novel extension of the RRR model which we call sparse varia...
Magnus O. Ulfarsson, Victor Solo
ICASSP
2011
IEEE
12 years 11 months ago
Application specific loss minimization using gradient boosting
Gradient boosting is a flexible machine learning technique that produces accurate predictions by combining many weak learners. In this work, we investigate its use in two applica...
Bin Zhang, Abhinav Sethy, Tara N. Sainath, Bhuvana...
ICASSP
2011
IEEE
12 years 11 months ago
Analog joint source-channel coding in Rayleigh fading channels
We consider discrete-time all-analog-processing joint sourcechannel coding, using non-linear spiral-like curves. We assume a Rayleigh channel, where the receiver may employ or not...
Glauber Gomes de Oliveira Brante, Richard Demo Sou...