Signal Processing | Sciweavers

146

INTERSPEECH
2010

113views Signal Processing» more INTERSPEECH 2010»

A rule-based backchannel prediction model using pitch and pause information

15 years 24 days ago

We manually designed rules for a backchannel (BC) prediction model based on pitch and pause information. In short, the model predicts a BC when there is a pause of a certain lengt...

Khiet P. Truong, Ronald Poppe, Dirk Heylen

claim paper

Read More »

173

click to vote

INTERSPEECH
2010

135views Signal Processing» more INTERSPEECH 2010»

The INTERSPEECH 2010 paralinguistic challenge

15 years 24 days ago

Download felix.syntheticspeech.de

Most paralinguistic analysis tasks are lacking agreed-upon evaluation procedures and comparability, in contrast to more `traditional' disciplines in speech analysis. The INTE...

Björn Schuller, Stefan Steidl, Anton Batliner...

claim paper

Read More »

177

click to vote

INTERSPEECH
2010

152views Signal Processing» more INTERSPEECH 2010»

Combining five acoustic level modeling methods for automatic speaker age and gender recognition

15 years 24 days ago

Download www-scf.usc.edu

This paper presents a novel automatic speaker age and gender identification approach which combines five different methods at the acoustic level to improve the baseline performanc...

Ming Li, Chi-Sang Jung, Kyu Jeong Han

claim paper

Read More »

163

click to vote

INTERSPEECH
2010

117views Signal Processing» more INTERSPEECH 2010»

Comparison of HMM and TMDN methods for lip synchronisation

15 years 24 days ago

Download www.cstr.ed.ac.uk

This paper presents a comparison between a hidden Markov model (HMM) based method and a novel artificial neural network (ANN) based method for lip synchronisation. Both model type...

Gregor Hofer, Korin Richmond

claim paper

Read More »

112

click to vote

INTERSPEECH
2010

135views Signal Processing» more INTERSPEECH 2010»

Paraphrase generation to improve text-to-speech synthesis

15 years 24 days ago

Download www.mendeley.com

Text-to-speech synthesizer systems are of overall good quality, especially when adapted to a specific task. Given this task and an adapted voice corpus, the message quality is mai...

Ghislain Putois, Jonathan Chevelu, Cédric B...

claim paper

Read More »

177

click to vote

INTERSPEECH
2010

132views Signal Processing» more INTERSPEECH 2010»

Improvements to the equal-parameter BIC for speaker diarization

15 years 24 days ago

Download www.xavieranguera.com

This paper discusses a set of modifications regarding the use of the Bayesian Information Criterion (BIC) for the speaker diarization task. We focus on the specific variant of the...

Themos Stafylakis, Xavier Anguera

claim paper

Read More »

155

Voted

INTERSPEECH
2010

101views Signal Processing» more INTERSPEECH 2010»

AutoBI - a tool for automatic toBI annotation

15 years 24 days ago

Download eniac.cs.qc.cuny.edu

This paper describes the AuToBI tool for automatic generation of hypothesized ToBI labels. While research on automatic prosodic annotation has been conducted for many years, AuToB...

Andrew Rosenberg

claim paper

Read More »

159

click to vote

INTERSPEECH
2010

117views Signal Processing» more INTERSPEECH 2010»

Rapid bootstrapping of five eastern european languages using the rapid language adaptation toolkit

15 years 24 days ago

Download csl.anthropomatik.kit.edu

This paper presents our latest efforts toward LVCSR systems for five Eastern European languages such as Bulgarian, Croatian, Czech, Polish, and Russian using our Rapid Language Ad...

Ngoc Thang Vu, Tim Schlippe, Franziska Kraus, Tanj...

claim paper

Read More »

171

Voted

INTERSPEECH
2010

140views Signal Processing» more INTERSPEECH 2010»

Robust automatic speech recognition with decoder oriented ideal binary mask estimation

15 years 24 days ago

Download www.isle.illinois.edu

In this paper, we propose a joint optimal method for automatic speech recognition (ASR) and ideal binary mask (IBM) estimation in transformed into the cepstral domain through a ne...

Lae-Hoon Kim, Kyung-Tae Kim, Mark Hasegawa-Johnson

claim paper

Read More »

128

click to vote

INTERSPEECH
2010

88views Signal Processing» more INTERSPEECH 2010»

Predicting human perception and ASR classification of word-final [t] by its acoustic sub-segmental properties

15 years 24 days ago

Download pubman.mpdl.mpg.de

This paper presents a study on the acoustic sub-segmental properties of word-final /t/ in conversational standard Dutch and how these properties contribute to whether humans and a...

Barbara Schuppler, Mirjam Ernestus, Wim A. van Dom...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers