n-gram language models

50

TASLP
2010

97views more TASLP 2010»

Hierarchical Bayesian Language Models for Conversational Speech Recognition

13 years 9 months ago

Traditional n-gram language models are widely used in state-of-the-art large vocabulary speech recognition systems. This simple model suffers from some limitations, such as overfi...

Songfang Huang, Steve Renals

claim paper

Read More »

47

click to vote

EMNLP
2009

96views Natural Language Processing» more EMNLP 2009»

Less is More: Significance-Based N-gram Selection for Smaller, Better Language Models

14 years 25 days ago

Download www.aclweb.org

The recent availability of large corpora for training N-gram language models has shown the utility of models of higher order than just trigrams. In this paper, we investigate meth...

Robert C. Moore, Chris Quirk

claim paper

Read More »

60

click to vote

ACL
2009

78views Computational Linguistics» more ACL 2009»

Improved Smoothing for N-gram Language Models Based on Ordinary Counts

14 years 27 days ago

Download www.aclweb.org

Kneser-Ney (1995) smoothing and its variants are generally recognized as having the best perplexity of any known method for estimating N-gram language models. Kneser-Ney smoothing...

Robert C. Moore, Chris Quirk

claim paper

Read More »

37

click to vote

EMNLP
2010

167views Natural Language Processing» more EMNLP 2010»

Storing the Web in Memory: Space Efficient Language Models with Constant Time Retrieval

14 years 1 months ago

Download www.aclweb.org

We present three novel methods of compactly storing very large n-gram language models. These methods use substantially less space than all known approaches and allow n-gram probab...

David Guthrie, Mark Hepple

claim paper

Read More »

49

click to vote

TSD
2010
Springer

140views Signal Processing» more TSD 2010»

Improving Automatic Image Captioning Using Text Summarization Techniques

14 years 1 months ago

Download nil.fdi.ucm.es

This paper presents two diﬀerent approaches to automatic captioning of geo-tagged images by summarizing multiple web-documents that contain information related to an image’s lo...

Laura Plaza, Elena Lloret, Ahmet Aker

claim paper

Read More »

60

click to vote

ICGI
2010
Springer

161views Natural Language Processing» more ICGI 2010»

Enhanced Suffix Arrays as Language Models: Virtual k-Testable Languages

14 years 4 months ago

Download ilk.uvt.nl

Abstract. In this article, we propose the use of suffix arrays to efficiently implement n-gram language models with practically unlimited size n. This approach, which is used with ...

Herman Stehouwer, Menno van Zaanen

claim paper

Read More »

47

click to vote

NIPS
2008

141views Information Technology» more NIPS 2008»

A Scalable Hierarchical Distributed Language Model

14 years 4 months ago

Download www.cs.toronto.edu

Neural probabilistic language models (NPLMs) have been shown to be competitive with and occasionally superior to the widely-used n-gram language models. The main drawback of NPLMs...

Andriy Mnih, Geoffrey E. Hinton

claim paper

Read More »

39

click to vote

LREC
2008

108views Education» more LREC 2008»

A Lightweight and Efficient Tool for Cleaning Web Pages

14 years 4 months ago

Download www.lrec-conf.org

Originally conceived as a "naive" baseline experiment using traditional n-gram language models as classifiers, the NCLEANER system has turned out to be a fast and lightw...

Stefan Evert

claim paper

Read More »

43

click to vote

EMNLP
2008

143views Natural Language Processing» more EMNLP 2008»

Coarse-to-Fine Syntactic Machine Translation using Language Projections

14 years 4 months ago

Download www.aclweb.org

The intersection of tree transducer-based translation models with n-gram language models results in huge dynamic programs for machine translation decoding. We propose a multipass,...

Slav Petrov, Aria Haghighi, Dan Klein

claim paper

Read More »

50

click to vote

CICLING
2003
Springer

116views Natural Language Processing» more CICLING 2003»

Experiments with Linguistic Categories for Language Model Optimization

14 years 8 months ago

Download gtts.ehu.es

In this work1 we obtain robust category-based language models to be integrated into speech recognition systems. Deductive rules are used to select linguistic categories and to matc...

Arantza Casillas, Amparo Varona, Inés Torre...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers