Sciweavers

ICASSP
2011
IEEE
13 years 9 days ago
Accurate parameter generation using fixed-point arithmetic for embedded HMM-based speech synthesizers
Parameter trajectory generation for HMM-based speech synthesis is practically achieved using only fixed-point arithmetic with 32-bit integers. Since processors for embedded devic...
Nobuyuki Nishizawa, Tsuneo Kato
ICASSP
2011
IEEE
13 years 9 days ago
Compressed learning of high-dimensional sparse functions
This paper presents a simple randomised algorithm for recovering high-dimensional sparse functions, i.e. functions f : [0, 1]d → R which depend effectively only on k out of d va...
Karin Schnass, Jan Vybíral
ICASSP
2011
IEEE
13 years 9 days ago
Bird species recognition combining acoustic and sequence modeling
The goal of this work was to explore modeling techniques to improve bird species classification from audio samples. We first developed an unsupervised approach to obtain approxima...
Martin Graciarena, Michelle Delplanche, Elizabeth ...
ICASSP
2011
IEEE
13 years 9 days ago
Integrating articulatory features using Kullback-Leibler divergence based acoustic model for phoneme recognition
In this paper, we propose a novel framework to integrate articulatory features (AFs) into HMM- based ASR system. This is achieved by using posterior probabilities of different AFs...
Ramya Rasipuram, Magimai.-Doss Mathew
ICASSP
2011
IEEE
13 years 9 days ago
Bayesian reinforcement learning for POMDP-based dialogue systems
Spoken dialogue systems are gaining popularity with improvements in speech recognition technologies. Dialogue systems can be modeled effectively using POMDPs, achieving improvemen...
ShaoWei Png, Joelle Pineau
ICASSP
2011
IEEE
13 years 9 days ago
Nonlinear properties of snoring sounds
In this paper, the Gaussianity and linearity of the snoring sound (SS) segments extracted from respiratory sounds are discussed. The respiratory sound signals were recorded from 3...
Ali Azarbarzin, Zahra Moussavi
ICASSP
2011
IEEE
13 years 9 days ago
Using stacked transformations for recognizing foreign accented speech
A common problem in speech recognition for foreign accented speech is that there is not enough training data for an accent-specific or a speaker-specific recognizer. Speaker ada...
Peter Smit, Mikko Kurimo
ICASSP
2011
IEEE
13 years 9 days ago
A learning-based approach to explosives detection using Multi-Energy X-Ray Computed Tomography
In this paper we consider the task of classifying materials into explosives and non-explosives according to features obtainable from Multi-Energy X-ray Computed Tomography (MECT) ...
Limor Eger, Synho Do, Prakash Ishwar, W. Clem Karl...
ICASSP
2011
IEEE
13 years 9 days ago
Informative dialect recognition using context-dependent pronunciation modeling
We propose an informative dialect recognition system that learns phonetic transformation rules, and uses them to identify dialects. A hidden Markov model is used to align referenc...
Nancy F. Chen, Wade Shen, Joseph P. Campbell, Pedr...
ICASSP
2011
IEEE
13 years 9 days ago
Robust speaker turn role labeling of TV Broadcast News shows
Speaker role recognition in TV Broadcast News shows is addressed in this paper with a particular focus on speaker turn role labeling. A mixed approach combining speaker clustering...
Géraldine Damnati, Delphine Charlet