Sciweavers

ICASSP
2010
IEEE
13 years 7 months ago
Automatic disfluency removal for improving spoken language translation
Statistical machine translation (SMT) systems for spoken languages suffer from conversational speech phenomena, in particular, the presence of speech dis uencies. We examine the i...
Wen Wang, Gökhan Tür, Jing Zheng, Necip ...
ICASSP
2010
IEEE
13 years 7 months ago
Hierarchical Gaussian Mixture Model
Gaussian mixture models (GMMs) are a convenient and essential tool for the estimation of probability density functions. Although GMMs are used in many research domains from image ...
Vincent Garcia, Frank Nielsen, Richard Nock
ICASSP
2010
IEEE
13 years 7 months ago
Automatic state discovery for unstructured audio scene classification
In this paper we present a novel scheme for unstructured audio scene classification that possesses three highly desirable and powerful features: autonomy, scalability, and robust...
Julian Ramos, Sajid M. Siddiqi, Artur Dubrawski, G...
ICASSP
2010
IEEE
13 years 7 months ago
Stochastic cross-layer resource allocation for wireless networks using orthogonal access: Optimality and delay analysis
Efficient design of wireless networks requires implementation of cross-layer algorithms that exploit channel state information. Capitalizing on convex optimization and stochastic...
Antonio G. Marqués, Georgios B. Giannakis, ...
ICASSP
2010
IEEE
13 years 7 months ago
Evaluation of Distance Based Amplitude panning for spatial audio
Distance-Based Amplitude Panning (DBAP) has recently been proposed as a new technique for panning sound sources in two and three dimensional spaces spaces. In this paper, DBAP is ...
Dimitar Kostadinov, Joshua D. Reiss, Valeri Mladen...
ICASSP
2010
IEEE
13 years 7 months ago
On the use of speaker superfactors for speaker recognition
We propose a new method to characterize a speaker within the Joint Factor Analysis (JFA) framework. Scoring within the JFA framework can be costly and a new method was proposed to...
Nicolas Scheffer, Robbie Vogt
ICASSP
2010
IEEE
13 years 7 months ago
An adaptive initialization method for speaker Diarization based on prosodic features
The following article presents a novel, adaptive initialization scheme that can be applied to most state-of-the-art Speaker Diarization algorithms, i.e. algorithms that use agglom...
David Imseng, Gerald Friedland
ICASSP
2010
IEEE
13 years 7 months ago
Noise-to-mask ratio minimization by weighted non-negative matrix factorization
This paper proposes a novel algorithm for minimizing the perceptual distortion in non-negative matrix factorization (NMF) based audio representation. We formulate the noise-to-mas...
Joonas Nikunen, Tuomas Virtanen
ICASSP
2010
IEEE
13 years 7 months ago
Efficient weighted-sum-rate maximization for a class of half-duplex cooperative systems
In many half-duplex cooperative systems, the direct formulation of the problem of finding the jointly optimal power and channel resource allocation that maximizes a weighted sum ...
Wessam Mesbah, Timothy N. Davidson
ICASSP
2010
IEEE
13 years 7 months ago
Synthesis of filled pauses based on a disfluent speech model
In the present paper we present a new approach to the synthesis of filled pauses. The problem is tackled from the point of view of disfluent speech synthesis. Based on the synth...
Jordi Adell, Antonio Bonafonte, David Escudero Man...