Error Mining for Wide-Coverage Grammar Engineering

15 years 8 months ago

Download acl.ldc.upenn.edu

Parsing systems which rely on hand-coded linguistic descriptions can only perform adequately in as far as these descriptions are correct and complete. The paper describes an error mining technique to discover problems in hand-coded linguistic descriptions for parsing such as grammars and lexicons. By analysing parse results for very large unannotated corpora, the technique discovers missing, incorrect or incomplete linguistic descriptions. The technique uses the frequency of n-grams of words for arbitrary values of n. It is shown how a new combination of suffix arrays and perfect hash finite automata allows an efficient implementation.

Gertjan van Noord

Real-time Traffic

ACL 2004 | ACL 2007 | Error Mining Technique | Hand-coded Linguistic Descriptions | Incomplete Linguistic Descriptions |

claim paper

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2004
Where	ACL
Authors	Gertjan van Noord

Comments (0)

Sciweavers

Error Mining for Wide-Coverage Grammar Engineering

ACL 2004 | ACL 2007 | Error Mining Technique | Hand-coded Linguistic Descriptions | Incomplete Linguistic Descriptions |

Explore & Download

Productivity Tools

Sciweavers