Sciweavers

300 search results - page 17 / 60
» The COST-277 Speech Database
Sort
View
CSL
1999
Springer
13 years 8 months ago
A hidden Markov-model-based trainable speech synthesizer
This paper presents a new approach to speech synthesis in which a set of cross-word decision-tree state-clustered context-dependent hidden Markov models are used to define a set o...
R. E. Donovan, Philip C. Woodland
ICMCS
2009
IEEE
144views Multimedia» more  ICMCS 2009»
13 years 6 months ago
Speech control in surgery: A field analysis and strategies
This work introduces a robot driven camera controlled by speech. The SIMIS database of 20 recordings of real life surgical operations serves as basis for analyses and noise modell...
Björn Schuller, Salman Can, Hubertus Feussner...
ICASSP
2011
IEEE
13 years 11 days ago
A multi-stream ASR framework for BLSTM modeling of conversational speech
We propose a novel multi-stream framework for continuous conversational speech recognition which employs bidirectional Long Short-Term Memory (BLSTM) networks for phoneme predicti...
Martin Wöllmer, Florian Eyben, Björn Sch...
ICASSP
2011
IEEE
13 years 11 days ago
Non-negative matrix deconvolution in noise robust speech recognition
High noise robustness has been achieved in speech recognition by using sparse exemplar-based methods with spectrogram windows spanning up to 300 ms. A downside is that a large exe...
Antti Hurmalainen, Jort F. Gemmeke, Tuomas Virtane...
TASLP
2002
124views more  TASLP 2002»
13 years 8 months ago
Discriminating capabilities of syllable-based features and approaches of utilizing them for voice retrieval of speech informatio
With the rapidly growing use of the audio and multimedia information over the Internet, the technology for retrieving speech information using voice queries is becoming more and mo...
Berlin Chen, Hsin-Min Wang, Lin-Shan Lee