Conditional Random Fields (CRFs) are a state-of-the-art approach to natural language processing tasks like grapheme-tophoneme (g2p) conversion which is used to produce pronunciati...
Patrick Lehnen, Stefan Hahn, Andreas Guta, Hermann...
The use of speaker adaptation transforms as features for speaker recognition is an appealing alternative to conventional short-term cepstral features. In general, this kind of met...
Sound textures may be defined as sounds whose character depends on statistical properties as much as the specific details of each individually-perceived event. Recent work has d...
Daniel P. W. Ellis, Xiaohong Zeng, Josh H. McDermo...
Endmember extraction is of prime importance in the process of hyperspectral unmixing so as to study the mineral composition of a landscape from its hyperspectral observations. Tho...
The performance of a multiuser MIMO broadcast system depends highly on how the users being served are selected from the pool of users requesting service. Though dirty paper coding...
We present a novel approach to represent transients using spectral-domain amplitude-modulated/frequency-modulated (AM-FM) functions. The model is applied to the real and imaginary...
In this paper, we extend the work done on integrating multilayer perceptron (MLP) networks with HMM systems via the Tandem approach. In particular, we explore whether the use of D...
This paper investigates the use of phoneme class conditional probabilities as features (posterior features) for template-based ASR. Using 75 words and 600 words task-independent a...
Serena Soldo, Mathew Magimai-Doss, Joel Pinto, Her...
Finite-Difference Time Domain (FDTD) acoustic simulation was used to calculate Pinna-Related Transfer Functions (PRTFs) of the KEMAR manikin's DB60 pinna. A baseline set of 2...
This paper presents a sound source (talker) localization method using only a single microphone. In our previous work [1], we discussed the single-channel sound source localization...