Within and across sentence boundary language model

14 years 8 months ago

Download www.lsv.uni-saarland.de

In this paper, we propose two different language modeling approaches, namely skip trigram and across sentence boundary, to capture the long range dependencies. The skip trigram model is able to cover more predecessor words of the present word compared to the normal trigram while the same memory space is required. The across sentence boundary model uses the word distribution of the previous sentences to calculate the unigram probability which is applied as the emission probability in the word and the class model frameworks. Our experiments on the Penn Treebank [1] show that each of our proposed models and also their combination significantly outperform the baseline for both the word and the class models and their linear interpolation. The linear interpolation of the word and the class models with the proposed skip trigram and across sentence boundary models achieves 118.4 perplexity while the best state-of-the-art language model has a perplexity of 137.2 on the same dataset.

Saeedeh Momtazi, Friedrich Faubel, Dietrich Klakow

Real-time Traffic

Class Models | INTERSPEECH 2010 | Sentence Boundary | Sentence Boundary Models | Signal Processing |

claim paper

» A Maximum Entropy Approach to Identifying Sentence Boundaries

» Enabling a Uniform Programming Model Across the SoftwareHardware Boundary

» Dependency Parsing of Japanese Spoken Monologue Based on Clause Boundaries

» Modeling Workflow within Distributed Systems

» Liquid Metal ObjectOriented Programming Across the HardwareSoftware Boundary

» Consistent Product Line Configuration across File Type and Product Line Boundaries

» Toward communicating simple sentences using pictorial representations

» Parsing a Natural Language Using Mutual Information Statistics

Post Info
More Details (n/a)

Added	18 May 2011
Updated	18 May 2011
Type	Journal
Year	2010
Where	INTERSPEECH
Authors	Saeedeh Momtazi, Friedrich Faubel, Dietrich Klakow

Comments (0)

Sciweavers

Within and across sentence boundary language model

Class Models | INTERSPEECH 2010 | Sentence Boundary | Sentence Boundary Models | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers