Exploiting Multi-Word Units in History-Based Probabilistic Generation

14 years 8 months ago

Download acl.ldc.upenn.edu

We present a simple history-based model for sentence generation from LFG f-structures, which improves on the accuracy of previous models by breaking down PCFG independence assumptions so that more f-structure conditioning context is used in the prediction of grammar rule expansions. In addition, we present work on experiments with named entities and other multi-word units, showing a statistically signiﬁcant improvement of generation accuracy. Tested on section 23 of the Penn Wall Street Journal Treebank, the techniques described in this paper improve BLEU scores from 66.52 to 68.82, and coverage from 98.18% to 99.96%.

Deirdre Hogan, Conor Cafferkey, Aoife Cahill, Jose

Real-time Traffic

EMNLP 2007 | Grammar Rule Expansions | Natural Language Processing | PCFG Independence Assumptions | Simple History-based Model |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2007
Where	EMNLP
Authors	Deirdre Hogan, Conor Cafferkey, Aoife Cahill, Josef van Genabith

Comments (0)

Sciweavers

Exploiting Multi-Word Units in History-Based Probabilistic Generation

EMNLP 2007 | Grammar Rule Expansions | Natural Language Processing | PCFG Independence Assumptions | Simple History-based Model |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers