Sciweavers

AAAI
1994

Corpus-Driven Knowledge Acquisition for Discourse Analysis

14 years 1 months ago
Corpus-Driven Knowledge Acquisition for Discourse Analysis
The availability of large on-line text corpora provides a natural and promising bridge between the worlds of natural language processing (NLP) and machine learning (ML). In recent years, the NLP community has been aggressively investigating statistical techniques to drive part-of-speech taggers, but application-specific text corpora can be used to drive knowledge acquisition at much higher levels as well. In this paper we will show how ML techniques can be used to support knowledge acquisition for information extraction systems. It is often very difficult to specify an explicit domain model for many information extraction applications, and it is always labor intensive to implement hand-coded heuristics for each new domain. We have discovered that it is nevertheless possible to use ML algorithms in order to capture knowledge that is only implicitly present in a representative text corpus. Our work addresses issues traditionally associated with discourse analysis and intersentential inf...
Stephen Soderland, Wendy G. Lehnert
Added 02 Nov 2010
Updated 02 Nov 2010
Type Conference
Year 1994
Where AAAI
Authors Stephen Soderland, Wendy G. Lehnert
Comments (0)