Compensation of extrinsic variability in speaker verification systems on simulated Skype and HF channel data

13 years 6 months ago

Download www5.informatik.uni-erlangen.de

In this work we focus on speaker veriﬁcation on channels of varying quality, namely Skype and high frequency (HF) radio. In our setup, we assume to have telephone recordings of speakers for training, but recordings of different channels for testing with varying (lower) signal quality. Starting from a Gaussian mixture / support vector machine (GMM/SVM) baseline, we evaluate multi-condition training (MCT), an ideal channel classiﬁcation approach (ICC), and nuisance attribute projection (NAP) to compensate for the loss of information due to the transmission. In an evaluation on Switchboard-2 data using Skype and HF channel simulators, we show that, for good signal quality, NAP improves the baseline system performance from 5% EER to 3.33% EER (for both Skype and HF). For strongly distorted data, MCT or, if adequate, ICC turn out to be the method of choice.

Korbinian Riedhammer, Tobias Bocklet, Elmar Nö

Real-time Traffic

HF Channel Simulators | ICASSP 2011 | Signal Processing | Signal Quality | Skype |

claim paper

Post Info
More Details (n/a)

Added	20 Aug 2011
Updated	20 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Korbinian Riedhammer, Tobias Bocklet, Elmar Nöth

Comments (0)

Sciweavers

Compensation of extrinsic variability in speaker verification systems on simulated Skype and HF channel data

HF Channel Simulators | ICASSP 2011 | Signal Processing | Signal Quality | Skype |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers