Sciweavers

679 search results - page 108 / 136
» speech 2010
Sort
View
ICASSP
2010
IEEE
13 years 10 months ago
Acceleration of sequence kernel computation for real-time speaker identification
The sequence kernel has been shown to be a promising kernel function for learning from sequential data such as speech and DNA. However, it is not scalable to massive datasets due ...
Makoto Yamada, Masashi Sugiyama, Gordon Wichern, T...
ICASSP
2010
IEEE
13 years 10 months ago
Non-parallel training for many-to-many eigenvoice conversion
This paper presents a novel training method of an eigenvoice Gaussian mixture model (EV-GMM) effectively using non-parallel data sets for many-to-many eigenvoice conversion, which...
Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Ki...
ICASSP
2010
IEEE
13 years 10 months ago
An improved consensus-like method for Minimum Bayes Risk decoding and lattice combination
In this paper we describe a method for Minimum Bayes Risk decoding for speech recognition. This is a technique similar to Consensus a.k.a. Confusion Network Decoding, in which we ...
Haihua Xu, Daniel Povey, Lidia Mangu, Jie Zhu
ICASSP
2010
IEEE
13 years 10 months ago
An adaptive level of detail approach to nonlinear estimation
In this work, we present a general method for approximating nonlinear transformations of Gaussian mixture random variables. It is based on transforming the individual Gaussians wi...
Friedrich Faubel, Dietrich Klakow
ICASSP
2010
IEEE
13 years 10 months ago
Voice activity detection using harmonic frequency components in likelihood ratio test
This paper proposes a new statistical model-based likelihood ratio test (LRT) VAD to obtain reliable speech / non-speech decisions. In the proposed method, the likelihood ratio (L...
Lee Ngee Tan, Bengt J. Borgstrom, Abeer Alwan