The Gaussian mixture model (GMM) can approximate arbitrary probability distributions, which makes it a powerful tool for feature representation and classification. However, it suffers from insufficient training data, especially when the feature space is of high dimensionality. In this paper, we present a novel approach to boost the GMMs via discriminant analysis in which the required amount of training data depends only upon the number of classes, regardless of the feature dimension. We demonstrate the effectiveness of the proposed BoostGMM-DA classifier by applying it to the problem of emotion recognition in speech. Our experiment results indicate that significantly higher recognition rates are achieved by the BoostGMM-DA classifier than are achieved by the conventional GMM minimum error rate (MER) classifier under the same training conditions, and that significantly less training data are required for the BoostGMM-DA classifier to yield comparable recognition rates to the GM...
Hao Tang, Thomas S. Huang