Less is More: Significance-Based N-gram Selection for Smaller, Better Language Models

13 years 11 months ago

Download www.aclweb.org

The recent availability of large corpora for training N-gram language models has shown the utility of models of higher order than just trigrams. In this paper, we investigate methods to control the increase in model size resulting from applying standard methods at higher orders. We introduce significance-based N-gram selection, which not only reduces model size, but also improves perplexity for several smoothing methods, including Katz backoff and absolute discounting. We also show that, when combined with a new smoothing method and a novel variant of weighted-difference pruning, our selection method performs better in the trade-off between model size and perplexity than the best pruning method we found for modified Kneser-Ney smoothing.

Robert C. Moore, Chris Quirk

Real-time Traffic

EMNLP 2009 | Model Size | N-gram Language Models | Natural Language Processing | Smoothing Method |

claim paper

Post Info
More Details (n/a)

Added	17 Feb 2011
Updated	17 Feb 2011
Type	Journal
Year	2009
Where	EMNLP
Authors	Robert C. Moore, Chris Quirk

Comments (0)

Sciweavers

Less is More: Significance-Based N-gram Selection for Smaller, Better Language Models

EMNLP 2009 | Model Size | N-gram Language Models | Natural Language Processing | Smoothing Method |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers