Sciweavers

CSL
2007
Springer
14 years 12 days ago
Speaker-adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation
A novel speaker-adaptive learning algorithm is developed and evaluated for a hidden trajectory model of speech coarticulation and reduction. Central to this model is the process o...
Dong Yu, Li Deng, Alex Acero
CSL
2007
Springer
14 years 12 days ago
Discriminative n-gram language modeling
This paper describes discriminative language modeling for a large vocabulary speech recognition task. We contrast two parameter estimation methods: the perceptron algorithm, and a...
Brian Roark, Murat Saraclar, Michael Collins
CSL
2007
Springer
14 years 12 days ago
Stochastic and syntactic techniques for predicting phrase breaks
Determining the position of breaks in a sentence is a key task for a text-to-speech (TTS) system. We describe some methods for phrase break prediction in which the whole sentence ...
Ian Read, Stephen Cox
CSL
2007
Springer
14 years 12 days ago
Accessing speech data using strategic fixation
When users access information from text, they engage in strategic fixation, visually scanning the text to focus on regions of interest. However, because speech is both serial and ...
Steve Whittaker, Julia Hirschberg
CSL
2007
Springer
14 years 12 days ago
Articulatory feature recognition using dynamic Bayesian networks
This paper describes the use of dynamic Bayesian networks for the task of articulatory feature recognition. We show that by modeling the dependencies between a set of 6 multi-leve...
Joe Frankel, Mirjam Wester, Simon King
CSL
2007
Springer
14 years 12 days ago
On noise masking for automatic missing data speech recognition: A survey and discussion
Automatic speech recognition (ASR) has reached very high levels of performance in controlled situations. However, the performance degrades significantly when environmental noise ...
Christophe Cerisara, Sébastien Demange, Jea...
CSL
2007
Springer
14 years 12 days ago
Modeling durations of syllables using neural networks
In this paper, we propose a neural network model for predicting the durations of syllables. A four layer feedforward neural network trained with backpropagation algorithm is used ...
K. Sreenivasa Rao, B. Yegnanarayana
CSL
2007
Springer
14 years 12 days ago
Soft indexing of speech content for search in spoken documents
The paper presents the Position Specific Posterior Lattice (PSPL), a novel lossy representation of automatic speech recognition lattices that naturally lends itself to efficient ...
Ciprian Chelba, Jorge Silva, Alex Acero
CSL
2007
Springer
14 years 12 days ago
Synthesized speech intelligibility and persuasion: Speech rate and non-native listeners
This experiment assessed the effect of variation in speech rate on comprehension and persuasiveness of a message presented in text-to-speech (TTS) synthesis to native and non-nat...
Caroline Jones, Lynn Berry, Catherine Stevens