Sciweavers

INTERSPEECH
2010
13 years 2 months ago
On the interdependencies between voice quality, glottal gaps, and voice-source related acoustic measures
In human speech production, the voice source contains important non-lexical information, especially relating to a speaker's voice quality. In this study, direct measurements ...
Yen-Liang Shue, Gang Chen, Abeer Alwan
INTERSPEECH
2010
13 years 2 months ago
Speech synthesis by modeling harmonics structure with multiple function
In this paper, we present a new approach for the speech synthesis, in which speech utterances are synthesized using the parameters of spectro-modeling function (Multiple function)...
Toru Nakashika, Ryuki Tachibana, Masafumi Nishimur...
INTERSPEECH
2010
13 years 2 months ago
An exploration of voice source correlates of focus
This pilot study explores how the voice source parameters vary in focally accented syllables. It examines the dynamics of the voice source parameters in an all-voiced short declar...
Irena Yanushevskaya, Christer Gobl, John Kane, Ail...
INTERSPEECH
2010
13 years 2 months ago
Augmentation of adaptation data
Linear regression based speaker adaptation approaches can improve Automatic Speech Recognition (ASR) accuracy significantly for a target speaker. However, when the available adapt...
Ravichander Vipperla, Steve Renals, Joe Frankel
INTERSPEECH
2010
13 years 2 months ago
Within and across sentence boundary language model
In this paper, we propose two different language modeling approaches, namely skip trigram and across sentence boundary, to capture the long range dependencies. The skip trigram mo...
Saeedeh Momtazi, Friedrich Faubel, Dietrich Klakow
INTERSPEECH
2010
13 years 2 months ago
Quality-based playout buffering with FEC for conversational voIP
In Voice-over-IP, buffer delay and packet loss are two main factors effecting perceived conversational quality. A quality-based algorithm aims to seek an optimum balancing of dela...
Qipeng Gong, Peter Kabal
INTERSPEECH
2010
13 years 2 months ago
Prosodic timing analysis for articulatory re-synthesis using a bank of resonators with an adaptive oscillator
A method for the analysis of prosodic-level temporal structure is introduced. The method is based on measured phase angles of an oscillator as that oscillator is made to synchroni...
Michael C. Brady
INTERSPEECH
2010
13 years 2 months ago
On the relation of Bayes risk, word error, and word posteriors in ASR
In automatic speech recognition, we are faced with a wellknown inconsistency: Bayes decision rule is usually used to minimize sentence (word sequence) error, whereas in practice w...
Ralf Schlüter, Markus Nußbaum-Thom, Her...
INTERSPEECH
2010
13 years 2 months ago
Towards a robust face recognition system using compressive sensing
An application of compressive sensing (CS) theory in imagebased robust face recognition is considered. Most contemporary face recognition systems suffer from limited abilities to ...
Allen Y. Yang, Zihan Zhou, Yi Ma, Shankar Sastry