In this paper we propose discriminative training of hierarchical acoustic models for large vocabulary continuous speech recognition tasks. After presenting our hierarchical modeling framework, we describe how the models can be generated with either Minimum Classification Error or large-margin training. Experiments on a large vocabulary lecture transcription task show that the hierarchical model can yield more than
Hung-An Chang, James R. Glass