A One Pass Decoder Design For Large Vocabulary Recognition

15 years 8 months ago

Download www.aclweb.org

To achieve reasonable accuracy in large vocabulary speech recognition systems, it is important to use detailed acoustic models together with good long span language models. For example, in the Wall Street Journal (WSJ) task both cross-word triphones and a trigram language model are necessary to achieve state-of-the-art performance. However, when using these models, the size of a pre-compiled recognition network can make a standard Viterbi search infeasible and hence, either multiple-pass or asynchronous stack decoding schemes are typically used. In tl:fispaper, we show that timesynchronous one-pass decoding using cross-word triphones and a trigram language model can be implemented using a dynamically built tree-structured network. This approach avoids the compromises inherent in using fast-matches or preliminary passes and is relatively efficient in implementation. It was included in the HTK large vocabulary speech recognition system used for the 1993 ARPA WSJ evaluation and experimen...

J. J. Odell, V. Valtchev, Philip C. Woodland, S. J

Real-time Traffic

Language Model | NAACL 1994 | NAACL 2007 | Trigram Language Model | Vocabulary Speech Recognition |

claim paper

» Large vocabulary sign language recognition based on hierarchical decision trees

» Towards automated large vocabulary gesture search

» A Comparison Between Etymon and WordBased Chinese Sign Language Recognition Systems

» Variational approximation of longspan language models for lvcsr

» Flexible Low Power Probability Density Estimation Unit For Speech Recognition

» Multilingual Speech Databases at LDC

» Feature versus model based noise robustness

» Enhancing a Sign Language Translation System with VisionBased Features

Post Info
More Details (n/a)

Added	02 Nov 2010
Updated	02 Nov 2010
Type	Conference
Year	1994
Where	NAACL
Authors	J. J. Odell, V. Valtchev, Philip C. Woodland, S. J. Young

Comments (0)

Sciweavers

A One Pass Decoder Design For Large Vocabulary Recognition

Language Model | NAACL 1994 | NAACL 2007 | Trigram Language Model | Vocabulary Speech Recognition |

Explore & Download

Productivity Tools

Sciweavers