This paper presents a novel method of enhancing esophageal speech using statistical voice conversion. Esophageal speech is one of the alternative speaking methods for laryngectome...
Modern approaches to speaker recognition (verification) operate in a space of “supervectors” created via concatenation of the mean vectors of a Gaussian mixture model (GMM) a...
Balaji Vasan Srinivasan, Dmitry N. Zotkin, Ramani ...
Abstract. This paper presents a new algorithm for solving the permutation ambiguity in convolutive blind source separation. When transformed to the frequency domain, the source sep...
Abstract. The underdetermined blind audio source separation problem is often addressed in the time-frequency domain by assuming that each time-frequency point is an independently d...
In a previous work, we developed a quasi-efficient maximum likelihood approach for blindly separating stationary, temporally correlated sources modeled by Markov processes. In this...