A Corpus for Cross-Document Co-reference

15 years 8 months ago

Download www.lrec-conf.org

This paper describes a newly created text corpus of news articles that has been annotated for cross-document co-reference. Being able to robustly resolve references to entities across document boundaries will provide a useful capability for a variety of tasks, ranging from practical information retrieval applications to challenging research in information extraction and natural language understanding. This annotated corpus is intended to encourage the development of systems that can more accurately address this problem. A manual annotation tool was developed that allowed the complete corpus to be searched for likely co-referring entity mentions. This corpus of 257K words links mentions of co-referent people, locations and organizations (subject to some additional constraints). Each of the documents had already been annotated for within-document coreference by the LDC as part of the ACE series of evaluations. The annotation process was bootstrapped with a string-matching-based linking ...

David Day, Janet Hitzeman, Michael L. Wick, Keith

Real-time Traffic

Annotated Corpus | Co-referring Entity Mentions | Education | LREC 2008 | Text Corpus |

claim paper

» Person Cross Document Coreference with Name Perplexity Estimates

» Profile Based CrossDocument Coreference Using Kernelized Fuzzy Relational Clustering

» Collective CrossDocument Relation Extraction Without Labelled Data

» An Empirical Investigation of the Relation Between Discourse Structure and CoReference

» CrossDocument Coreference on a Large Scale Corpus

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	LREC
Authors	David Day, Janet Hitzeman, Michael L. Wick, Keith Crouch, Massimo Poesio

Comments (0)

Sciweavers

A Corpus for Cross-Document Co-reference

Annotated Corpus | Co-referring Entity Mentions | Education | LREC 2008 | Text Corpus |

Explore & Download

Productivity Tools

Sciweavers