Sciweavers

ICASSP
2011
IEEE
13 years 4 months ago
Study of robustness of zero frequency resonator method for extraction of fundamental frequency
The objective of this work is to develop and study the robustness of the zero frequency resonator (ZFR) based method for extraction of the fundamental frequency (F0) of speech sig...
Bayya Yegnanarayana, S. R. Mahadeva Prasanna, S. G...
ICASSP
2011
IEEE
13 years 4 months ago
Perceptual differentiation modeling explains phoneme mispronunciation by non-native speakers
One of the difficulties in second language (L2) learning is the weakness in discriminating between acoustic diversity within an L2 phoneme category and between different categori...
Christos Koniaris, Olov Engwall
ICASSP
2011
IEEE
13 years 4 months ago
Dynamic signal combining for distributed microphone systems in car environments
Distributed microphone systems in cars usually provide dedicated microphones for several speakers where each microphone captures the desired speech signal at the best. The signal ...
Timo Matheja, Markus Buck, Achim Eichentopf
ICASSP
2011
IEEE
13 years 4 months ago
Speaker characterization using spectral subband energy ratio based on Harmonic plus Noise Model
This paper proposes a feature extraction for speaker characterization by exploring the relationship between the two distinct components of the speech signal, one is harmonics acco...
Yanhua Long, Zhi-Jie Yan, Frank K. Soong, Li-Rong ...
INTERSPEECH
2010
13 years 7 months ago
Dynamic model selection for spectral voice conversion
Statistical methods for voice conversion are usually based on a single model selected in order to represent a tradeoff between goodness of fit and complexity. In this paper we ass...
Pierre Lanchantin, Xavier Rodet
INTERSPEECH
2010
13 years 7 months ago
Comparison of approaches for instrumentally predicting the quality of text-to-speech systems
In this paper, we compare and combine different approaches for instrumentally predicting the perceived quality of Text-to-Speech systems. First, a log-likelihood is determined by ...
Sebastian Möller, Florian Hinterleitner, Tiag...
INTERSPEECH
2010
13 years 7 months ago
Setup for acoustic-visual speech synthesis by concatenating bimodal units
This paper presents preliminary work on building a system able to synthesize concurrently the speech signal and a 3D animation of the speaker's face. This is done by concaten...
Asterios Toutios, Utpala Musti, Slim Ouni, Vincent...
CVPR
2011
IEEE
13 years 8 months ago
Modeling Human Activities as Speech
Human activity recognition and speech recognition appear to be two loosely related research areas. However, on a careful thought, there are several analogies between activity and ...
Chia-Chih Chen, Jake Aggarwal
TIFS
2008
159views more  TIFS 2008»
14 years 12 days ago
Chaotic-Type Features for Speech Steganalysis
We investigate the use of chaotic-type features for recorded speech steganalysis. Considering that data hiding within a speech signal distorts the chaotic properties of the origina...
Osman Hilmi Kocal, Emrah Yürüklü, I...
TASLP
2008
71views more  TASLP 2008»
14 years 12 days ago
Dual-Source Transfer-Function Generalized Sidelobe Canceller
Full duplex hands-free man/machine interface often suffers from directional non-stationary interference (such as a competing speaker or an echo signal) as well as a stationary int...
Gal Reuven, Sharon Gannot, Israel Cohen