A Semantically Annotated Swedish Medical Corpus

15 years 9 months ago

Download www.lrec-conf.org

With the information overload in the life sciences there is an increasing need for annotated corpora, particularly with biological and biomedical entities, which is the driving force for data-driven language processing applications and the empirical approach to language study. Inspired by the work in the GENIA Corpus, which is one of the very few of such corpora, extensively used in the biomedical field, and in order to fulfil the needs of our research, we have collected a Swedish medical corpus, the MEDLEX Corpus. MEDLEX is a large structurally and linguistically annotated document collection, consisting of a variety of text documents related to various medical text subfields, and does not focus at a particular medical genre, due to the lack of large Swedish resources within a particular medical subdomain. Out of this collection we selected 300 documents which were manually examined by two human experts who inspected, corrected and/or accordingly modified the automatically provided a...

Dimitrios Kokkinakis

Real-time Traffic

Education | LREC 2008 | Medical | Particular Medical Genre | Swedish Medical Corpus |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	LREC
Authors	Dimitrios Kokkinakis

Comments (0)

Sciweavers

A Semantically Annotated Swedish Medical Corpus

Education | LREC 2008 | Medical | Particular Medical Genre | Swedish Medical Corpus |

Explore & Download

Productivity Tools

Sciweavers