Sciweavers

CSL
2007
Springer
13 years 11 months ago
Stochastic and syntactic techniques for predicting phrase breaks
Determining the position of breaks in a sentence is a key task for a text-to-speech (TTS) system. We describe some methods for phrase break prediction in which the whole sentence ...
Ian Read, Stephen Cox
CSL
2007
Springer
13 years 11 months ago
Accessing speech data using strategic fixation
When users access information from text, they engage in strategic fixation, visually scanning the text to focus on regions of interest. However, because speech is both serial and ...
Steve Whittaker, Julia Hirschberg
CSL
2007
Springer
13 years 11 months ago
Articulatory feature recognition using dynamic Bayesian networks
This paper describes the use of dynamic Bayesian networks for the task of articulatory feature recognition. We show that by modeling the dependencies between a set of 6 multi-leve...
Joe Frankel, Mirjam Wester, Simon King
CSL
2007
Springer
13 years 11 months ago
Ginisupport vector machines for segmental minimum Bayes risk decoding of continuous speech
Veera Venkataramani, Shantanu Chakrabartty, Willia...
CSL
2007
Springer
13 years 11 months ago
On noise masking for automatic missing data speech recognition: A survey and discussion
Automatic speech recognition (ASR) has reached very high levels of performance in controlled situations. However, the performance degrades significantly when environmental noise ...
Christophe Cerisara, Sébastien Demange, Jea...
CSL
2007
Springer
13 years 11 months ago
Modeling durations of syllables using neural networks
In this paper, we propose a neural network model for predicting the durations of syllables. A four layer feedforward neural network trained with backpropagation algorithm is used ...
K. Sreenivasa Rao, B. Yegnanarayana
CSL
2007
Springer
13 years 11 months ago
Soft indexing of speech content for search in spoken documents
The paper presents the Position Specific Posterior Lattice (PSPL), a novel lossy representation of automatic speech recognition lattices that naturally lends itself to efficient ...
Ciprian Chelba, Jorge Silva, Alex Acero
CSL
2007
Springer
13 years 11 months ago
Synthesized speech intelligibility and persuasion: Speech rate and non-native listeners
This experiment assessed the effect of variation in speech rate on comprehension and persuasiveness of a message presented in text-to-speech (TTS) synthesis to native and non-nat...
Caroline Jones, Lynn Berry, Catherine Stevens
CSL
2007
Springer
13 years 11 months ago
Partially observable Markov decision processes for spoken dialog systems
In a spoken dialog system, determining which action a machine should take in a given situation is a difficult problem because automatic speech recognition is unreliable and hence ...
Jason D. Williams, Steve Young
CSL
2007
Springer
13 years 11 months ago
Automatic phonetic transcription of large speech corpora
This study is aimed at investigating whether automatic phonetic transcription procedures can approximate manual transcriptions typically delivered with contemporary large speech c...
Christophe Van Bael, Lou Boves, Henk van den Heuve...