Estimation of Stochastic Attribute-Value Grammars using an Informative Sample

14 years 4 months ago

Download acl.ldc.upenn.edu

We argue that some of the computational complexity associated with estimation of stochastic attributevalue grammars can be reduced by training upon an informative subset of the full training set. Results using the parsed Wall Street Journal corpus show that in some circumstances, it is possible to obtain better estimation results using an informative sample than when training upon all the available material. Further experimentation demonstrates that with unlexicalised models, a Gaussian prior can reduce overfitting. However, when models are lexiealised and contain overlapping features, overfitting does not seem to be a problem, and a Gmlssian prior makes minimal difference to performance. Our approach is applicable for situal;ions when there are an infeasibly large mnnber of parses in the training set, or else for when recovery of these parses fl'om a packed representation is itself comi)utationally expensive.

Miles Osborne

Real-time Traffic

COLING 2000 | COLING 2008 | Parsed Wall Street | Stochastic Attributevalue Grammars | Training Set |

claim paper

Post Info
More Details (n/a)

Added	01 Nov 2010
Updated	01 Nov 2010
Type	Conference
Year	2000
Where	COLING
Authors	Miles Osborne

Comments (0)

Sciweavers

Estimation of Stochastic Attribute-Value Grammars using an Informative Sample

COLING 2000 | COLING 2008 | Parsed Wall Street | Stochastic Attributevalue Grammars | Training Set |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers