Sciweavers

ECML
2003
Springer

A Hybrid Language Model based on Stochastic Context-free Grammars

14 years 4 months ago
A Hybrid Language Model based on Stochastic Context-free Grammars
Abstract. This paper explores the use of initial Stochastic Context-Free Grammars (SCFG) obtained from a treebank corpus for the learning of SCFG by means of estimation algorithms. A hybrid language model is defined as a combination of a word-based n-gram, which is used to capture the local relations between words, and a category-based SCFG with a word distribution into categories, which is defined to represent the long-term relations between these categories. Experiments on the UPenn Treebank corpus are reported. These experiments have been carried out in terms of the test set perplexity and the word error rate in a speech recognition experiment.
Diego Linares, José-Miguel Benedí, J
Added 06 Jul 2010
Updated 06 Jul 2010
Type Conference
Year 2003
Where ECML
Authors Diego Linares, José-Miguel Benedí, Joan-Andreu Sánchez
Comments (0)