Sciweavers

TASLP
2016
8 years 8 months ago
Robust Source Localization and Enhancement With a Probabilistic Steered Response Power Model
—Source localization and enhancement are often treated separately in the array processing literature. One can apply Steered Response Power (SRP) localization to determine the sou...
Johannes Traa, David Wingate, Noah D. Stein, Paris...
TASLP
2016
8 years 8 months ago
Packet Loss Concealment Based on Deep Neural Networks for Digital Speech Transmission
—In this paper, we propose the regression-based packet loss concealment (PLC) for digital speech transmission by using deep neural networks (DNNs) with a multiple-layer deep arch...
Bong-Ki Lee, Joon-Hyuk Chang
TASLP
2016
8 years 8 months ago
Design of Directivity Patterns with a Unique Null of Maximum Multiplicity
—Differential beamforming is one of the most popular beamforming approaches, which has the great potential to form frequency-invariant directivity patterns. In this paper, we stu...
Chao Pan, Jacob Benesty, Jingdong Chen
TASLP
2016
8 years 8 months ago
i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition
—We propose a unified approach to automatic foreign accent recognition. It takes advantage of recent technology advances in both linguistics and acoustics based modeling techniq...
Hamid Behravan, Ville Hautamäki, Sabato Marco...
TASLP
2016
8 years 8 months ago
Boosting Contextual Information for Deep Neural Network Based Voice Activity Detection
—Voice activity detection (VAD) is an important topic in audio signal processing. Contextual information is important for improving the performance of VAD at low signal-to-noise ...
Xiao-Lei Zhang, DeLiang Wang
TASLP
2016
8 years 8 months ago
Robust Quad-Based Audio Fingerprinting
—We propose an audio fingerprinting method that adapts findings from the field of blind astrometry to define simple, efficiently representable characteristic feature combina...
Reinhard Sonnleitner, Gerhard Widmer
TASLP
2016
8 years 8 months ago
Bayesian Analysis of Phoneme Confusion Matrices
Abstract—This paper presents a parametric Bayesian approach to the statistical analysis of phoneme confusion matrices measured for groups of individual listeners in one or more t...
Leijon Leijon, Gustav Eje Henter, Martin Dahlquist
TASLP
2016
8 years 8 months ago
A Fast Method for High-Resolution Voiced/Unvoiced Detection and Glottal Closure/Opening Instant Estimation of Speech
—We propose a fast speech analysis method which simultaneously performs high-resolution voiced/unvoiced detection (VUD) and accurate estimation of glottal closure and glottal ope...
Andreas I. Koutrouvelis, George P. Kafentzis, Niko...
TASLP
2016
8 years 8 months ago
Complex Ratio Masking for Monaural Speech Separation
—Speech separation systems usually operate on the short-time Fourier transform (STFT) of noisy speech, and enhance only the magnitude spectrum while leaving the phase spectrum un...
Donald S. Williamson, Yuxuan Wang, DeLiang Wang
TASLP
2016
8 years 8 months ago
Unseen Noise Estimation Using Separable Deep Auto Encoder for Speech Enhancement
—Unseen noise estimation is a key yet challenging step to make a speech enhancement algorithm work in adverse environments. At worst, the only prior knowledge we know about the e...
Meng Sun, Xiongwei Zhang, Hugo Van hamme, Thomas F...