The Creation of a Large-Scale LFG-Based Gold Parsebank

15 years 8 months ago

Download www.lrec-conf.org

Systems for syntactically parsing sentences have long been recognized as a priority in Natural Language Processing. Statistics-based systems require large amounts of high quality syntactically parsed data. Using the XLE toolkit developed at PARC and the LFG Parsebanker interface developed at Bergen, the Parsebank Project at Powerset has generated a rapidly increasing volume of syntactically parsed data. By using these tools, we are able to leverage the LFG framework to provide richer analyses via both constituent (c-) and functional (f-) structures. Additionally, the Parsebanking Project uses source data from Wikipedia rather than source data limited to a specific genre, such as the Wall Street Journal. This paper outlines the process we used in creating a large-scale LFG-Based Parsebank to address many of the shortcomings of previously-created parse banks such as the Penn Treebank. While the Parsebank corpus is still in progress, preliminary results using the data in a variety of con...

Alexis Baird, Christopher R. Walker

Real-time Traffic

Education | LFG Parsebanker Interface | LREC 2010 | Source Data | Syntactically |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	LREC
Authors	Alexis Baird, Christopher R. Walker

Comments (0)

Sciweavers

The Creation of a Large-Scale LFG-Based Gold Parsebank

Education | LFG Parsebanker Interface | LREC 2010 | Source Data | Syntactically |

Explore & Download

Productivity Tools

Sciweavers