Sciweavers

INTERSPEECH
2010
13 years 6 months ago
Multichannel noise reduction using low order RTF estimate
The relative transfer function generalized sidelobe canceler (RTF-GSC) is a popular method for implementing multichannel speech enhancement. However, an accurate estimation of cha...
Subhojit Chakladar, Nam Soo Kim, Yu Gwang Jin, Tae...
INTERSPEECH
2010
13 years 6 months ago
Learning a language model from continuous speech
This paper presents a new approach to language model construction, learning a language model not from text, but directly from continuous speech. A phoneme lattice is created using...
Graham Neubig, Masato Mimura, Shinsuke Mori, Tatsu...
INTERSPEECH
2010
13 years 6 months ago
Automatic speech recognition system channel modeling
In this paper, we present a systems approach for channel modeling of an Automatic Speech Recognition (ASR) system. This can have implications in improving speech recognition compo...
Qun Feng Tan, Kartik Audhkhasi, Panayiotis G. Geor...
INTERSPEECH
2010
13 years 6 months ago
Enhancements of viterbi search for fast unit selection synthesis
The paper describes the optimisation of Viterbi search used in unit selection TTS, since with a large speech corpus necessary to achieve a high level of naturalness, the performan...
Daniel Tihelka, Jirí Kala, Jindrich Matouse...
INTERSPEECH
2010
13 years 6 months ago
Automatic estimation of transcription accuracy and difficulty
Managing a large-scale speech transcription task with a team of human transcribers requires effective quality control and workload distribution. As it becomes easier and cheaper t...
Brandon Roy, Soroush Vosoughi, Deb Roy
INTERSPEECH
2010
13 years 6 months ago
Nucleus position within the intonation phrase: a typological study of English, Czech and Hungarian
In this paper we examine cases of non-final nucleus (or sentence stress) in English, Czech and Hungarian. These three languages differ substantially with respect to word order rul...
Tomás Dubeda, Katalin Mády
INTERSPEECH
2010
13 years 6 months ago
Articulatory synthesis and perception of plosive-vowel syllables with virtual consonant targets
Virtual articulatory targets are a concept to explain the different trajectories of primary and secondary articulators during consonant production, as well as the different places...
Peter Birkholz, Bernd J. Kröger, Christiane N...
INTERSPEECH
2010
13 years 6 months ago
Memory-based active learning for French broadcast news
Stochastic dependency parsers can achieve very good results when they are trained on large corpora that have been manually annotated. Active learning is a procedure that aims at r...
Frédéric Tantini, Christophe Cerisar...
INTERSPEECH
2010
13 years 6 months ago
Voice search for development
In light of the serious problems with both illiteracy and information access in the developing world, there is a widespread belief that speech technology can play a significant ro...
Etienne Barnard, Johan Schalkwyk, Charl Johannes v...
INTERSPEECH
2010
13 years 6 months ago
Online adaptive learning for speech recognition decoding
We describe a new method for pruning in dynamic models based on running an adaptive filtering algorithm online during decoding to predict aspects of the scores in the near future....
Jeff Bilmes, Hui Lin