Improved Smoothing for N-gram Language Models Based on Ordinary Counts

15 years 4 months ago

Download www.aclweb.org

Kneser-Ney (1995) smoothing and its variants are generally recognized as having the best perplexity of any known method for estimating N-gram language models. Kneser-Ney smoothing, however, requires nonstandard N-gram counts for the lowerorder models used to smooth the highestorder model. For some applications, this makes Kneser-Ney smoothing inappropriate or inconvenient. In this paper, we introduce a new smoothing method based on ordinary counts that outperforms all of the previous ordinary-count methods we have tested, with the new method eliminating most of the gap between Kneser-Ney and those methods.

Robert C. Moore, Chris Quirk

Real-time Traffic

ACL 2009 | Computational Linguistics | N-gram Language Models | Ordinary Counts | Smoothing |

claim paper

Post Info
More Details (n/a)

Added	16 Feb 2011
Updated	16 Feb 2011
Type	Journal
Year	2009
Where	ACL
Authors	Robert C. Moore, Chris Quirk

Comments (0)

Sciweavers

Improved Smoothing for N-gram Language Models Based on Ordinary Counts

ACL 2009 | Computational Linguistics | N-gram Language Models | Ordinary Counts | Smoothing |

Explore & Download

Productivity Tools

Sciweavers