An Endogeneous Corpus-Based Method for Structural Noun Phrase Disambiguation

15 years 8 months ago

Download acl.ldc.upenn.edu

In this paper, we describe a method for structural noun phrase disambiguation which mainly relies on the examination of the text corpus under analysis and doesn't need to integrate any domain-dependent lexico- or syntactico-semantic information. This method is implemented in the Terminology Extraction Sotware LEXTER. We first explain why the integration of LEXTER in the LEXTER-K project, which aims at building a tool for knowledge extraction from large technical text corpora, requires improving the quality of the terminolgy extracted by LEXTER. Then we briefly describe the way LEXTER works and show what kind of disambiguation it has to perform when parsing "maximal-length" noun phrases. We introduce a method of disambiguation which relies on a very simple idea : whenever LEXTER has to choose among several competing noun sub-groups in order to disambiguate a maximal-length noun phrase, it checks each of these sub-groups if it occurs anywhere else in the corpus in a non-a...

Didier Bourigault

Real-time Traffic

Disambiguation | EACL 1993 | Natural Language Processing | Noun Phrase Disambiguation | Noun Phrases |

claim paper

Added	02 Nov 2010
Updated	02 Nov 2010
Type	Conference
Year	1993
Where	EACL
Authors	Didier Bourigault

Sciweavers

An Endogeneous Corpus-Based Method for Structural Noun Phrase Disambiguation

Disambiguation | EACL 1993 | Natural Language Processing | Noun Phrase Disambiguation | Noun Phrases |

Explore & Download

Productivity Tools

Sciweavers