Capturing Local Variability for Speaker Normalization in Speech Recognition

14 years 22 days ago

Download lorien.die.upm.es

The new model reduces the impact of local spectral and temporal variability by estimating a finite set of spectral and temporal warping factors which are applied to speech at the frame level. Optimum warping factors are obtained while decoding in a locally constrained search. The model involves augmenting the states of a standard hidden Markov model (HMM), providing an additional degree of freedom. It is argued in this paper that this represents an efficient and effective method for compensating local variability in speech which may have potential application to a broader array of speech transformations. The technique is presented in the context of existing methods for frequency warpingbased speaker normalization for ASR. The new model is evaluated in clean and noisy task domains using subsets of the Aurora 2, the Spanish Speech-Dat-Car, and the TIDIGITS corpora. In addition, some experiments are performed on a Spanish language corpus collected from a population of speakers with a rang...

Antonio Miguel, Eduardo Lleida, Richard Rose, Luis

Real-time Traffic

Hidden Markov Model | TASLP 2008 | Temporal Variability | Temporal Warping Factors |

claim paper

Post Info
More Details (n/a)

Added	15 Dec 2010
Updated	15 Dec 2010
Type	Journal
Year	2008
Where	TASLP
Authors	Antonio Miguel, Eduardo Lleida, Richard Rose, Luis Buera, Oscar Saz, Alfonso Ortega

Comments (0)

Sciweavers

Capturing Local Variability for Speaker Normalization in Speech Recognition

Hidden Markov Model | TASLP 2008 | Temporal Variability | Temporal Warping Factors |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers