Sciweavers

INTERSPEECH
2010
13 years 6 months ago
A comparative study of constrained and unconstrained approaches for segmentation of speech signal
In this work, we compare different approaches for speech segmentation, of which some are constrained and the remaining are unconstrained by phone transcript. A high accuracy speec...
Venkatesh Keri, Kishore Prahallad
INTERSPEECH
2010
13 years 6 months ago
Modeling pronunciation variation with context-dependent articulatory feature decision trees
We consider the problem of predicting the surface pronunciations of a word in conversational speech, using a model of pronunciation variation based on articulatory features. We bu...
Sam Bowman, Karen Livescu
INTERSPEECH
2010
13 years 6 months ago
Using dependency parsing and machine learning for factoid question answering on spoken documents
This paper presents our experiments in question answering for speech corpora. These experiments focus on improving the answer extraction step of the QA process. We present two app...
Pere Comas, Jordi Turmo, Lluís Màrqu...
INTERSPEECH
2010
13 years 6 months ago
The relation between pitch perception preference and emotion identification
In our study, we explore the effect of synthetic vs analytic listening mode on the identification of emotions. Numerous psychoacoustic studies have shown that listeners differ in ...
Marie Nilsenová, Martijn Goudbeek, Luuk Kem...
INTERSPEECH
2010
13 years 6 months ago
Rhythm and formant features for automatic alcohol detection
Two speech feature sets, RMS rhythmicity and formant frequencies F1-F4, are analyzed for their ability to distinguish alcoholized from sober speech. We describe the statistical fr...
Florian Schiel, Christian Heinrich, Veronika Neume...
INTERSPEECH
2010
13 years 6 months ago
Cross-lingual talker discrimination
This paper describes a talker discrimination experiment in which native English listeners were presented with two sentences spoken by bilingual talkers (English/German and English...
Mirjam Wester
INTERSPEECH
2010
13 years 6 months ago
On the potential of glottal signatures for speaker recognition
Most of current speaker recognition systems are based on features extracted from the magnitude spectrum of speech. However the excitation signal produced by the glottis is expecte...
Thomas Drugman, Thierry Dutoit
INTERSPEECH
2010
13 years 6 months ago
Semi-supervised training of Gaussian mixture models by conditional entropy minimization
In this paper, we propose a new semi-supervised training method for Gaussian Mixture Models. We add a conditional entropy minimizer to the maximum mutual information criteria, whi...
Jui-Ting Huang, Mark Hasegawa-Johnson
INTERSPEECH
2010
13 years 6 months ago
Boosted mixture learning of Gaussian mixture HMMs for speech recognition
In this paper, we propose a novel boosted mixture learning (BML) framework for Gaussian mixture HMMs in speech recognition. BML is an incremental method to learn mixture models fo...
Jun Du, Yu Hu, Hui Jiang
INTERSPEECH
2010
13 years 6 months ago
Beyond sentence prosody
The prosody of a sentence (utterance) when it appears in a discourse context differs substantially from when it is uttered in isolation. This paper addresses why paragraph is a di...
Chiu-yu Tseng