The objective of this work is to develop and study the robustness of the zero frequency resonator (ZFR) based method for extraction of the fundamental frequency (F0) of speech sig...
Bayya Yegnanarayana, S. R. Mahadeva Prasanna, S. G...
One of the difficulties in second language (L2) learning is the weakness in discriminating between acoustic diversity within an L2 phoneme category and between different categori...
Distributed microphone systems in cars usually provide dedicated microphones for several speakers where each microphone captures the desired speech signal at the best. The signal ...
This paper proposes a feature extraction for speaker characterization by exploring the relationship between the two distinct components of the speech signal, one is harmonics acco...
Yanhua Long, Zhi-Jie Yan, Frank K. Soong, Li-Rong ...
Statistical methods for voice conversion are usually based on a single model selected in order to represent a tradeoff between goodness of fit and complexity. In this paper we ass...
In this paper, we compare and combine different approaches for instrumentally predicting the perceived quality of Text-to-Speech systems. First, a log-likelihood is determined by ...
This paper presents preliminary work on building a system able to synthesize concurrently the speech signal and a 3D animation of the speaker's face. This is done by concaten...
Human activity recognition and speech recognition appear to be two loosely related research areas. However, on a careful thought, there are several analogies between activity and ...
We investigate the use of chaotic-type features for recorded speech steganalysis. Considering that data hiding within a speech signal distorts the chaotic properties of the origina...
Full duplex hands-free man/machine interface often suffers from directional non-stationary interference (such as a competing speaker or an echo signal) as well as a stationary int...