Sciweavers

INTERSPEECH
2010
13 years 6 months ago
A rule-based backchannel prediction model using pitch and pause information
We manually designed rules for a backchannel (BC) prediction model based on pitch and pause information. In short, the model predicts a BC when there is a pause of a certain lengt...
Khiet P. Truong, Ronald Poppe, Dirk Heylen
INTERSPEECH
2010
13 years 6 months ago
The INTERSPEECH 2010 paralinguistic challenge
Most paralinguistic analysis tasks are lacking agreed-upon evaluation procedures and comparability, in contrast to more `traditional' disciplines in speech analysis. The INTE...
Björn Schuller, Stefan Steidl, Anton Batliner...
INTERSPEECH
2010
13 years 6 months ago
Combining five acoustic level modeling methods for automatic speaker age and gender recognition
This paper presents a novel automatic speaker age and gender identification approach which combines five different methods at the acoustic level to improve the baseline performanc...
Ming Li, Chi-Sang Jung, Kyu Jeong Han
INTERSPEECH
2010
13 years 6 months ago
Comparison of HMM and TMDN methods for lip synchronisation
This paper presents a comparison between a hidden Markov model (HMM) based method and a novel artificial neural network (ANN) based method for lip synchronisation. Both model type...
Gregor Hofer, Korin Richmond
INTERSPEECH
2010
13 years 6 months ago
Paraphrase generation to improve text-to-speech synthesis
Text-to-speech synthesizer systems are of overall good quality, especially when adapted to a specific task. Given this task and an adapted voice corpus, the message quality is mai...
Ghislain Putois, Jonathan Chevelu, Cédric B...
INTERSPEECH
2010
13 years 6 months ago
Improvements to the equal-parameter BIC for speaker diarization
This paper discusses a set of modifications regarding the use of the Bayesian Information Criterion (BIC) for the speaker diarization task. We focus on the specific variant of the...
Themos Stafylakis, Xavier Anguera
INTERSPEECH
2010
13 years 6 months ago
AutoBI - a tool for automatic toBI annotation
This paper describes the AuToBI tool for automatic generation of hypothesized ToBI labels. While research on automatic prosodic annotation has been conducted for many years, AuToB...
Andrew Rosenberg
INTERSPEECH
2010
13 years 6 months ago
Rapid bootstrapping of five eastern european languages using the rapid language adaptation toolkit
This paper presents our latest efforts toward LVCSR systems for five Eastern European languages such as Bulgarian, Croatian, Czech, Polish, and Russian using our Rapid Language Ad...
Ngoc Thang Vu, Tim Schlippe, Franziska Kraus, Tanj...
INTERSPEECH
2010
13 years 6 months ago
Robust automatic speech recognition with decoder oriented ideal binary mask estimation
In this paper, we propose a joint optimal method for automatic speech recognition (ASR) and ideal binary mask (IBM) estimation in transformed into the cepstral domain through a ne...
Lae-Hoon Kim, Kyung-Tae Kim, Mark Hasegawa-Johnson
INTERSPEECH
2010
13 years 6 months ago
Predicting human perception and ASR classification of word-final [t] by its acoustic sub-segmental properties
This paper presents a study on the acoustic sub-segmental properties of word-final /t/ in conversational standard Dutch and how these properties contribute to whether humans and a...
Barbara Schuppler, Mirjam Ernestus, Wim A. van Dom...