Accurate Unlexicalized Parsing

14 years 2 months ago

Download nlp.stanford.edu

We demonstrate that an unlexicalized PCFG can parse much more accurately than previously shown, by making use of simple, linguistically motivated state splits, which break down false independence assumptions latent in a vanilla treebank grammar. Indeed, its performance of 86.36% (LP/LR F1) is better than that of early lexicalized PCFG models, and surprisingly close to the current state-of-theart. This result has potential uses beyond establishing a strong lower bound on the maximum possible accuracy of unlexicalized models: an unlexicalized PCFG is much more compact, easier to replicate, and easier to interpret than more complex lexical models, and the parsing algorithms are simpler, more widely understood, of lower asymptotic complexity, and easier to optimize. In the early 1990s, as probabilistic methods swept NLP, parsing work revived the investigation of probabilistic context-free grammars (PCFGs) (Booth and Thomson, 1973; Baker, 1979). However, early results on the utility of PCF...

Dan Klein, Christopher D. Manning

Real-time Traffic

ACL 2003 | ACL 2007 | Early Lexicalized Pcfg | Lexicalized | Unlexicalized Pcfg |

claim paper

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2003
Where	ACL
Authors	Dan Klein, Christopher D. Manning

Comments (0)

Sciweavers

Accurate Unlexicalized Parsing

ACL 2003 | ACL 2007 | Early Lexicalized Pcfg | Lexicalized | Unlexicalized Pcfg |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers