A simple and general method is described that can combine different knowledge sources to reorder N-best lists of hypotheses produced by a speech recognizer. The method is automati...
Manny Raynez, David Carter, Vassilios Digalakis, P...
The Linguistic Data Consortium (LDC) is currently involved in a major effort to expand its multilingual text resources, in particular for machine translation, message understandin...
The Workshop included an extended group of presentations by selected US government agencies and an invited guest from the European Community. These presentations, amplified in the...
Improved acoustic modeling can significantly decrease the error rate in large-vocabulary speech recognition. Our approach to the problem is twofold. We first propose a scheme that...
The Air Travel Information System (ATIS) domain serves as the common evaluation task for ARPA"spoken language system developers.1To support this task, the Multi-Site ATIS Dat...
Deborah A. Dahl, Madeleine Bates, Michael Brown, W...
This paper describes eight telephone-speech corpora at various stages of development at the Center for Spoken Language Understanding. For each corpus, we describe data collection ...
Ronald Cole, Mike Noel, Daniel C. Burnett, Mark A....
Most recent research in trainable part of speech taggers has explored stochastic tagging. While these taggers obtain high accuracy, linguistic information is captured indirectly, ...