Sciweavers

ICASSP
2010
IEEE
13 years 7 months ago
Enhanced trellis based vector quantization for coordinated beamforming
This paper presents two techniques to boost the quantization performance of trelli-based beamforming vector quantization schemes [?], [?]. It is well known that tail-biting trelli...
Chun Kin Au-Yeung, Shahab Sanayei
ICASSP
2010
IEEE
13 years 7 months ago
Rapid integration of Parts of Speech information to improve reordering model for English-Farsi Speech to Speech Translation
Integrating Parts of Speech (POS) information to Machine Translation (MT) model usually amounts to significant changes in the MT decoder. We present a method to rapidly integrate...
Sameer Maskey, Bowen Zhou
ICASSP
2010
IEEE
13 years 7 months ago
Approximate nearest neighbors using sparse representations
A new method is introduced that makes use of sparse image representations to search for approximate nearest neighbors (ANN) under the normalized inner-product distance. The approa...
Joaquin Zepeda, Ewa Kijak, Christine Guillemot
ICASSP
2010
IEEE
13 years 7 months ago
Optimize the obvious: Automatic call flow generation
In commercial spoken dialog systems, call flows are built by call flow designers implementing a predefined business logic. While it may appear obvious from this logic how the c...
David Suendermann, Jackson Liscombe, Roberto Piera...
ICASSP
2010
IEEE
13 years 7 months ago
Asymptotic analysis of the Huberized LASSO estimator
Xiaohui Chen, Z. Jane Wang, Martin J. McKeown
ICASSP
2010
IEEE
13 years 7 months ago
On linear versus non-linear magnitude-DFT estimators and the influence of super-Gaussian speech priors
Although the linear mean-squared error (MSE) complex-DFT estimator, i.e., the Wiener filter, is well-known, its magnitude-DFT (MDFT) counterpart has never been considered in the ...
Richard C. Hendriks, Richard Heusdens
ICASSP
2010
IEEE
13 years 7 months ago
Learning-based auditory encoding for robust speech recognition
Yu-Hsiang Bosco Chiu, Bhiksha Raj, Richard M. Ster...
ICASSP
2010
IEEE
13 years 7 months ago
A new penalty term for the BIC with respect to speaker diarization
In this paper we revise the penalty term of the Bayesian Information Criterion (BIC). Based on our previous approach to penalize each cluster only with its corresponding effective...
Themos Stafylakis, Georgios Tzimiropoulos, Vassili...
ICASSP
2010
IEEE
13 years 7 months ago
Learning deep rhetorical structure for extractive speech summarization
Extractive summarization of conference and lecture speech is useful for online learning and references. We show for the first time that deep(er) rhetorical parsing of conference ...
Justin Jian Zhang, Pascale Fung
ICASSP
2010
IEEE
13 years 7 months ago
Statistical approach to enhancing esophageal speech based on Gaussian mixture models
This paper presents a novel method of enhancing esophageal speech using statistical voice conversion. Esophageal speech is one of the alternative speaking methods for laryngectome...
Hironori Doi, Keigo Nakamura, Tomoki Toda, Hiroshi...