A criterion for the enhancement of time-frequency masks in missing data recognition

15 years 1 months ago

Download www.ee.uwa.edu.au

Despite their effectiveness for robust speech processing, missing data techniques are vulnerable to errors in the classiﬁcation of the input speech signal’s time-frequency points. A direct method for the removal of these mask errors is through the top-down optimization of the estimated mask, however this requires a measure to evaluate the mask quality without a priori noise knowledge. In this paper we propose the normalized likelihood conﬁdence as such a criterion for robust speaker recognition. In this approach the accuracy with which an estimated mask classiﬁes time-frequency points as corrupt or reliable is related to its likelihood score conﬁdence. This is based on the conceptual effect of binary mask errors on the model likelihood distributions produced by accumulated marginalization densities. Experimental results conﬁrm a relationship between the normalized likelihood distance and the accuracy of the time-frequency mask produced by various estimation strategies.

Daniel Pullella, Roberto Togneri

Real-time Traffic

ICASSP 2009 | Mask | Mask Errors | Normalized Likelihood | Signal Processing |

claim paper

Added	21 May 2010
Updated	21 May 2010
Type	Conference
Year	2009
Where	ICASSP
Authors	Daniel Pullella, Roberto Togneri

Sciweavers

A criterion for the enhancement of time-frequency masks in missing data recognition

ICASSP 2009 | Mask | Mask Errors | Normalized Likelihood | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers