Mismatch in speech bandwidth between training and real operation greatly degrades the performance of automatic speech recognition (ASR) systems. Missing feature technique (MFT) is...
This paper presents a new acoustic-to-articulatory inversion methodbased on an episodic memory, which is an interesting model for two reasons. First, it does not rely on any assum...
Attention-Deficit Hyperactivity Disorder (ADHD) is the most common mental health problem in childhood and adolescence. Its diagnosis is commonly performed in a subjective manner ...
Diego Martin, Pablo Casaseca, Susana Alberola, Jos...
We describe a time-domain procedure for designing the synthesis filters of perfect-reconstruction oversampled filter banks. A condition matrix is derived from the basic magnitud...
In this paper, an approach for polyphonic music transcription based on joint multiple-F0 estimation and note onset/offset detection is proposed. For preprocessing, the resonator t...
In this paper, we propose a novel method for improving the visibility of an image (with fog or haze), as well as the image’s details. The proposed method adjusts the global cont...
Dongin Shin, Kristofor B. Gibson, Wonha Kim, Truon...
Recently, robust transmit beamforming has drawn considerable attention because it can provide guaranteed receiver performance in the presence of channel state information (CSI) er...
In this paper, we present a model for unsupervised pattern discovery using non-negative matrix factorization (NMF) with graph regularization. Though the regularization can be appl...
This paper presents a new algorithm based on shift-invariant probabilistic latent component analysis that analyzes harmonic structures in an audio signal. Each note in a constant-...
This paper presents a family of log-spectral amplitude (LSA) estimators for speech enhancement. Generalized Gamma distributed (GGD) priors are assumed for speech short-time spectr...