Corpus-Driven Knowledge Acquisition for Discourse Analysis

15 years 8 months ago

Download www.aaai.org

The availability of large on-line text corpora provides a natural and promising bridge between the worlds of natural language processing (NLP) and machine learning (ML). In recent years, the NLP community has been aggressively investigating statistical techniques to drive part-of-speech taggers, but application-specific text corpora can be used to drive knowledge acquisition at much higher levels as well. In this paper we will show how ML techniques can be used to support knowledge acquisition for information extraction systems. It is often very difficult to specify an explicit domain model for many information extraction applications, and it is always labor intensive to implement hand-coded heuristics for each new domain. We have discovered that it is nevertheless possible to use ML algorithms in order to capture knowledge that is only implicitly present in a representative text corpus. Our work addresses issues traditionally associated with discourse analysis and intersentential inf...

Stephen Soderland, Wendy G. Lehnert

Real-time Traffic

AAAI 1994 | Information Extraction | Information Extraction System | Intelligent Agents | Ml Algorithms |

claim paper

Post Info
More Details (n/a)

Added	02 Nov 2010
Updated	02 Nov 2010
Type	Conference
Year	1994
Where	AAAI
Authors	Stephen Soderland, Wendy G. Lehnert

Comments (0)

Sciweavers

Corpus-Driven Knowledge Acquisition for Discourse Analysis

AAAI 1994 | Information Extraction | Information Extraction System | Intelligent Agents | Ml Algorithms |

Explore & Download

Productivity Tools

Sciweavers