Sciweavers

TASLP
2008
148views more  TASLP 2008»
13 years 11 months ago
Combining Spectral Representations for Large-Vocabulary Continuous Speech Recognition
In this paper we investigate the combination of complementary acoustic feature streams in large vocabulary continuous speech recognition (LVCSR). We have explored the use of acoust...
Giulia Garau, Steve Renals
TASLP
2008
92views more  TASLP 2008»
13 years 11 months ago
Hybrid Signal-and-Link-Parametric Speech Quality Measurement for VoIP Communications
A hybrid signal-and-link-parametric approach to speech quality measurement for voice-over-Internet protocol (VoIP) communications is described. Connection parameters are used to de...
Tiago H. Falk, Wai-Yip Chan
TASLP
2008
207views more  TASLP 2008»
13 years 11 months ago
Joint Morphological-Lexical Language Modeling for Processing Morphologically Rich Languages With Application to Dialectal Arabic
Language modeling for an inflected language such as Arabic poses new challenges for speech recognition and machine translation due to its rich morphology. Rich morphology results i...
Ruhi Sarikaya, Mohamed Afify, Yonggang Deng, Hakan...
TASLP
2008
148views more  TASLP 2008»
13 years 11 months ago
A Minimum Distortion Noise Reduction Algorithm With Multiple Microphones
Abstract--The problem of noise reduction using multiple microphones has long been an active area of research. Over the past few decades, most efforts have been devoted to beamformi...
Jingdong Chen, Jacob Benesty, Yiteng Huang
TASLP
2008
81views more  TASLP 2008»
13 years 11 months ago
Regularized Linear Prediction of Speech
L. A. Ekman, W. Bastiaan Kleijn, Manohar N. Murthi
TASLP
2008
143views more  TASLP 2008»
13 years 11 months ago
Strategies to Improve the Robustness of Agglomerative Hierarchical Clustering Under Data Source Variation for Speaker Diarizatio
Many current state-of-the-art speaker diarization systems exploit agglomerative hierarchical clustering (AHC) as their speaker clustering strategy, due to its simple processing str...
K. J. Han, S. Kim, S. S. Narayanan
TASLP
2008
90views more  TASLP 2008»
13 years 11 months ago
Efficient Realization of Wave Digital Components for Physical Modeling and Sound Synthesis
Wave digital filters (WDFs) were originally developed for robust discrete-time simulation of analog filters, but recently they have been applied successfully to modeling of physica...
Matti Karjalainen
TASLP
2008
124views more  TASLP 2008»
13 years 11 months ago
Semantic Annotation and Retrieval of Music and Sound Effects
We present a computer audition system that can both annotate novel audio tracks with semantically meaningful words and retrieve relevant tracks from a database of unlabeled audio c...
Douglas Turnbull, Luke Barrington, D. Torres, Gert...
TASLP
2008
88views more  TASLP 2008»
13 years 11 months ago
Transforming Perceived Vocal Effort and Breathiness Using Adaptive Pre-Emphasis Linear Prediction
Abstract--This paper presents a technique to transform high-effort voices into breathy voices using adaptive pre-emphasis linear prediction (APLP). The primary benefit of this tech...
K. I. Nordstrom, George Tzanetakis, Peter F. Dries...
TASLP
2008
64views more  TASLP 2008»
13 years 11 months ago
Improved A Posteriori Speech Presence Probability Estimation Based on a Likelihood Ratio With Fixed Priors
Abstract--In this contribution we present an improved estimator for the speech presence probability at each time-frequency point in the short-time Fourier-transform domain. In cont...
Timo Gerkmann, Colin Breithaupt, Rainer Martin