—Source localization and enhancement are often treated separately in the array processing literature. One can apply Steered Response Power (SRP) localization to determine the sou...
Johannes Traa, David Wingate, Noah D. Stein, Paris...
—In this paper, we propose the regression-based packet loss concealment (PLC) for digital speech transmission by using deep neural networks (DNNs) with a multiple-layer deep arch...
—Differential beamforming is one of the most popular beamforming approaches, which has the great potential to form frequency-invariant directivity patterns. In this paper, we stu...
—We propose a unified approach to automatic foreign accent recognition. It takes advantage of recent technology advances in both linguistics and acoustics based modeling techniq...
—Voice activity detection (VAD) is an important topic in audio signal processing. Contextual information is important for improving the performance of VAD at low signal-to-noise ...
—We propose an audio fingerprinting method that adapts findings from the field of blind astrometry to define simple, efficiently representable characteristic feature combina...
Abstract—This paper presents a parametric Bayesian approach to the statistical analysis of phoneme confusion matrices measured for groups of individual listeners in one or more t...
Leijon Leijon, Gustav Eje Henter, Martin Dahlquist
—We propose a fast speech analysis method which simultaneously performs high-resolution voiced/unvoiced detection (VUD) and accurate estimation of glottal closure and glottal ope...
Andreas I. Koutrouvelis, George P. Kafentzis, Niko...
—Speech separation systems usually operate on the short-time Fourier transform (STFT) of noisy speech, and enhance only the magnitude spectrum while leaving the phase spectrum un...
—Unseen noise estimation is a key yet challenging step to make a speech enhancement algorithm work in adverse environments. At worst, the only prior knowledge we know about the e...
Meng Sun, Xiongwei Zhang, Hugo Van hamme, Thomas F...