Sciweavers

INTERSPEECH
2010
13 years 6 months ago
Invariant integration features combined with speaker-adaptation methods
Speaker-normalization and -adaptation methods are essential components of state-of-the-art speech recognition systems nowadays. Recently, so-called invariant integration features ...
Florian Müller, Alfred Mertins
INTERSPEECH
2010
13 years 6 months ago
Bayesian speaker recognition using Gaussian mixture model and laplace approximation
This paper presents a Bayesian approach for Gaussian mixture model (GMM)-based speaker identification. Some approaches evaluate the speaker score of a test speech utterance using ...
Shih-Sian Cheng, I-Fan Chen, Hsin-Min Wang
INTERSPEECH
2010
13 years 6 months ago
Context adaptive training with factorized decision trees for HMM-based speech synthesis
To achieve natural high quality synthesised speech in HMMbased speech synthesis, the effective modelling of complex acoustic and linguistic contexts is critical. Traditional appro...
Kai Yu, Heiga Zen, François Mairesse, Steve...
INTERSPEECH
2010
13 years 6 months ago
Extractive summarization using a latent variable model
Extractive multi-document summarization is the task of choosing sentences from a set of documents to compose a summary text in response to a user query. We propose a generative ap...
Asli Çelikyilmaz, Dilek Hakkani-Tür
INTERSPEECH
2010
13 years 6 months ago
Language model cross adaptation for LVCSR system combination
State-of-the-art large vocabulary continuous speech recognition (LVCSR) systems often combine outputs from multiple subsystems developed at different sites. Cross system adaptatio...
Xunying Liu, Mark J. F. Gales, Philip C. Woodland
INTERSPEECH
2010
13 years 6 months ago
Automatic speech recognition for assistive writing in speech supplemented word prediction
This paper describes a system for assistive writing, the Speech Supplemented Word Prediction Program (SSWPP). This system uses the first letter of a word typed by the user as well...
John-Paul Hosom, Tom Jakobs, Allen Baker, Susan Fa...
INTERSPEECH
2010
13 years 6 months ago
Unsupervised discovery and training of maximally dissimilar cluster models
One of the difficult problems of acoustic modeling for Automatic Speech Recognition (ASR) is how to adequately model the wide variety of acoustic conditions which may be present i...
Françoise Beaufays, Vincent Vanhoucke, Bria...
INTERSPEECH
2010
13 years 6 months ago
Excitation modeling based on waveform interpolation for HMM-based speech synthesis
It is generally known that a well-designed excitation produces high quality signals in hidden Markov model (HMM)-based speech synthesis systems. This paper proposes a novel techni...
June Sig Sung, Doo Hwa Hong, Kyung Hwan Oh, Nam So...
INTERSPEECH
2010
13 years 6 months ago
Synthesis of fast speech with interpolation of adapted HSMMs and its evaluation by blind and sighted listeners
In this paper we evaluate a method for generating synthetic speech at high speaking rates based on the interpolation of hidden semi-Markov models (HSMMs) trained on speech data re...
Michael Pucher, Dietmar Schabus, Junichi Yamagishi
INTERSPEECH
2010
13 years 6 months ago
Incremental word learning using large-margin discriminative training and variance floor estimation
We investigate incremental word learning in a Hidden Markov Model (HMM) framework suitable for human-robot interaction. In interactive learning, the tutoring time is a crucial fac...
Irene Ayllón Clemente, Martin Heckmann, Ale...