BioInfer: a corpus for information extraction in the biomedical domain

15 years 6 months ago

Download www.biomedcentral.com

Background: Lately, there has been a great interest in the application of information extraction methods to the biomedical domain, in particular, to the extraction of relationships of genes, proteins, and RNA from scientific publications. The development and evaluation of such methods requires annotated domain corpora. Results: We present BioInfer (Bio Information Extraction Resource), a new public resource providing an annotated corpus of biomedical English. We describe an annotation scheme capturing named entities and their relationships along with a dependency analysis of sentence syntax. We further present ontologies defining the types of entities and relationships annotated in the corpus. y, the corpus contains 1100 sentences from abstracts of biomedical research articles annotated for relationships, named entities, as well as syntactic dependencies. Supporting software is provided with the corpus. The corpus is unique in the domain in combining these annotation types for a singl...

Sampo Pyysalo, Filip Ginter, Juho Heimonen, Jari B

Real-time Traffic

Annotated | Biomedical | BMCBI 2007 | Information Extraction |

claim paper

» DependencyDriven Featurebased Learning for Extracting ProteinProtein Interactions from Bio...

» Highperformance information extraction with AliBaba

» Improving biomedical document retrieval using domain knowledge

» Semantic Annotation of Biomedical Literature Using Google

» Rerendering Semantic Ontologies Automatic Extensions to UMLS through Corpus Analytics

» Use of OWL 2 to Facilitate a Biomedical Knowledge Base Extracted from the GENIA Corpus

» Improving Biomedical Document Retrieval by Mining Domain Knowledge

» Eventbased Information Extraction for the biomedical domain the Caderige project

Post Info
More Details (n/a)

Added	08 Dec 2010
Updated	08 Dec 2010
Type	Journal
Year	2007
Where	BMCBI
Authors	Sampo Pyysalo, Filip Ginter, Juho Heimonen, Jari Björne, Jorma Boberg, Jouni Järvinen, Tapio Salakoski

Comments (0)

Sciweavers

BioInfer: a corpus for information extraction in the biomedical domain

Annotated | Biomedical | BMCBI 2007 | Information Extraction |

Explore & Download

Productivity Tools

Sciweavers