Exploiting Multiply Annotated Corpora in Biomedical Information Extraction Tasks

15 years 8 months ago

Download www.ltg.ed.ac.uk

This paper discusses the problem of utilising multiply annotated data in training biomedical information extraction systems. Two corpora, annotated with entities and relations, and containing a number of multiply annotated documents, are used to train named entity recognition and relation extraction systems. Several methods of automatically combining the multiple annotations to produce a single annotation are compared, but none produces better results than simply picking one of the annotated versions at random. It is also shown that adding extra singly annotated documents produces faster performance gains than adding extra multiply annotated documents.

Barry Haddow, Beatrice Alex

Real-time Traffic

Annotated | Education | LREC 2008 | Multiply Annotated Data | Multiply Annotated Documents |

claim paper

» Comparative analysis of five proteinprotein interaction corpora

» LINNAEUS A species name identification system for biomedical literature

» Learning Sentenceinternal Temporal Relations

» Modular Ontology Design Using Canonical Building Blocks in the Biochemistry Domain

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	LREC
Authors	Barry Haddow, Beatrice Alex

Comments (0)

Sciweavers

Exploiting Multiply Annotated Corpora in Biomedical Information Extraction Tasks

Annotated | Education | LREC 2008 | Multiply Annotated Data | Multiply Annotated Documents |

Explore & Download

Productivity Tools

Sciweavers