Sciweavers

ICASSP
2011
IEEE
12 years 11 months ago
Incorporating alignments into Conditional Random Fields for grapheme to phoneme conversion
Conditional Random Fields (CRFs) are a state-of-the-art approach to natural language processing tasks like grapheme-tophoneme (g2p) conversion which is used to produce pronunciati...
Patrick Lehnen, Stefan Hahn, Andreas Guta, Hermann...
ICASSP
2011
IEEE
12 years 11 months ago
Parallel Transformation Network features for speaker recognition
The use of speaker adaptation transforms as features for speaker recognition is an appealing alternative to conventional short-term cepstral features. In general, this kind of met...
Alberto Abad, Jordi Luque, Isabel Trancoso
ICASSP
2011
IEEE
12 years 11 months ago
Classifying soundtracks with audio texture features
Sound textures may be defined as sounds whose character depends on statistical properties as much as the specific details of each individually-perceived event. Recent work has d...
Daniel P. W. Ellis, Xiaohong Zeng, Josh H. McDermo...
ICASSP
2011
IEEE
12 years 11 months ago
Two effective and computationally efficient pure-pixel based algorithms for hyperspectral endmember extraction
Endmember extraction is of prime importance in the process of hyperspectral unmixing so as to study the mineral composition of a landscape from its hyperspectral observations. Tho...
Arul-Murugan Ambikapathi, Tsung-Han Chan, Chong-Yu...
ICASSP
2011
IEEE
12 years 11 months ago
User selection schemes for maximizing throughput of multiuser MIMO systems using Zero Forcing Beamforming
The performance of a multiuser MIMO broadcast system depends highly on how the users being served are selected from the pool of users requesting service. Though dirty paper coding...
Anh H. Nguyen, Bhaskar D. Rao
ICASSP
2011
IEEE
12 years 11 months ago
Spectral-envelope and group-delay models for transient signals - Applications to castanets and stop consonants
We present a novel approach to represent transients using spectral-domain amplitude-modulated/frequency-modulated (AM-FM) functions. The model is applied to the real and imaginary...
Ravi R. Shenoy, Chandra Sekhar Seelamantula
ICASSP
2011
IEEE
12 years 11 months ago
Comparing multilayer perceptron to Deep Belief Network Tandem features for robust ASR
In this paper, we extend the work done on integrating multilayer perceptron (MLP) networks with HMM systems via the Tandem approach. In particular, we explore whether the use of D...
Oriol Vinyals, Suman V. Ravuri
ICASSP
2011
IEEE
12 years 11 months ago
Posterior features for template-based ASR
This paper investigates the use of phoneme class conditional probabilities as features (posterior features) for template-based ASR. Using 75 words and 600 words task-independent a...
Serena Soldo, Mathew Magimai-Doss, Joel Pinto, Her...
ICASSP
2011
IEEE
12 years 11 months ago
Pinna sensitivity patterns reveal reflecting and diffracting surfaces that generate the first spectral notch in the front median
Finite-Difference Time Domain (FDTD) acoustic simulation was used to calculate Pinna-Related Transfer Functions (PRTFs) of the KEMAR manikin's DB60 pinna. A baseline set of 2...
Parham Mokhtari, Hironori Takemoto, Ryouichi Nishi...
ICASSP
2011
IEEE
12 years 11 months ago
Feature selection based on Multiple Kernel Learning for single-channel sound source localization using the acoustic transfer fun
This paper presents a sound source (talker) localization method using only a single microphone. In our previous work [1], we discussed the single-channel sound source localization...
Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki