n-gram models | Sciweavers

172

ICASSP
2011
IEEE

129views Signal Processing» more ICASSP 2011»

Variational approximation of long-span language models for lvcsr

14 years 10 months ago

Long-span language models that capture syntax and semantics are seldom used in the ﬁrst pass of large vocabulary continuous speech recognition systems due to the prohibitive sea...

Anoop Deoras, Tomas Mikolov, Stefan Kombrink, Mart...

claim paper

Read More »

210

click to vote

COLING
2010

234views Computational Linguistics» more COLING 2010»

Unsupervised Discriminative Language Model Training for Machine Translation using Simulated Confusion Sets

15 years 1 months ago

Download www.cs.jhu.edu

An unsupervised discriminative training procedure is proposed for estimating a language model (LM) for machine translation (MT). An English-to-English synchronous context-free gra...

Zhifei Li, Ziyuan Wang, Sanjeev Khudanpur, Jason E...

claim paper

Read More »

230

click to vote

ACL
2009

89views Computational Linguistics» more ACL 2009»

Variational Decoding for Statistical Machine Translation

15 years 4 months ago

Download www.cs.jhu.edu

Statistical models in machine translation exhibit spurious ambiguity. That is, the probability of an output string is split among many distinct derivations (e.g., trees or segment...

Zhifei Li, Jason Eisner, Sanjeev Khudanpur

claim paper

Read More »

182

click to vote

BMCBI
2005

153views more BMCBI 2005»

Learning Statistical Models for Annotating Proteins with Function Information using Biomedical Text

15 years 6 months ago

Download pages.cs.wisc.edu

Background: The BioCreative text mining evaluation investigated the application of text mining methods to the task of automatically extracting information from text in biomedical ...

Soumya Ray, Mark Craven

claim paper

Read More »

186

click to vote

ACL
2008

168views Computational Linguistics» more ACL 2008»

Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Translation

15 years 8 months ago

Download www.aclweb.org

In statistical language modeling, one technique to reduce the problematic effects of data sparsity is to partition the vocabulary into equivalence classes. In this paper we invest...

Jakob Uszkoreit, Thorsten Brants

claim paper

Read More »

187

click to vote

CIKM
2006
Springer

132views Information Technology» more CIKM 2006»

Text classification improved through multigram models

15 years 10 months ago

Download research.microsoft.com

Classification algorithms and document representation approaches are two key elements for a successful document classification system. In the past, much work has been conducted to...

Dou Shen, Jian-Tao Sun, Qiang Yang, Zheng Chen

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers