Search Sciweavers | Sciweavers

176

Voted

ICASSP
2010
IEEE

152views Signal Processing» more ICASSP 2010»

Acceleration of sequence kernel computation for real-time speaker identification

15 years 7 months ago

The sequence kernel has been shown to be a promising kernel function for learning from sequential data such as speech and DNA. However, it is not scalable to massive datasets due ...

Makoto Yamada, Masashi Sugiyama, Gordon Wichern, T...

claim paper

Read More »

177

click to vote

ICASSP
2010
IEEE

131views Signal Processing» more ICASSP 2010»

Non-parallel training for many-to-many eigenvoice conversion

15 years 7 months ago

Download spalab.naist.jp

This paper presents a novel training method of an eigenvoice Gaussian mixture model (EV-GMM) effectively using non-parallel data sets for many-to-many eigenvoice conversion, which...

Yamato Ohtani, Tomoki Toda, Hiroshi Saruwatari, Ki...

claim paper

Read More »

195

click to vote

ICASSP
2010
IEEE

172views Signal Processing» more ICASSP 2010»

An improved consensus-like method for Minimum Bayes Risk decoding and lattice combination

15 years 7 months ago

Download research.microsoft.com

In this paper we describe a method for Minimum Bayes Risk decoding for speech recognition. This is a technique similar to Consensus a.k.a. Confusion Network Decoding, in which we ...

Haihua Xu, Daniel Povey, Lidia Mangu, Jie Zhu

claim paper

Read More »

181

click to vote

ICASSP
2010
IEEE

259views Signal Processing» more ICASSP 2010»

An adaptive level of detail approach to nonlinear estimation

15 years 7 months ago

Download www.lsv.uni-saarland.de

In this work, we present a general method for approximating nonlinear transformations of Gaussian mixture random variables. It is based on transforming the individual Gaussians wi...

Friedrich Faubel, Dietrich Klakow

claim paper

Read More »

183

click to vote

ICASSP
2010
IEEE

121views Signal Processing» more ICASSP 2010»

Voice activity detection using harmonic frequency components in likelihood ratio test

15 years 7 months ago

Download www.ee.ucla.edu

This paper proposes a new statistical model-based likelihood ratio test (LRT) VAD to obtain reliable speech / non-speech decisions. In the proposed method, the likelihood ratio (L...

Lee Ngee Tan, Bengt J. Borgstrom, Abeer Alwan

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers