Sciweavers

LREC
2008

Learning-based Detection of Scientific Terms in Patient Information

14 years 28 days ago
Learning-based Detection of Scientific Terms in Patient Information
In this paper, we investigate the use of a machine-learning based approach to the specific problem of scientific term detection in patient information. Lacking lexical databases which differentiate between the scientific and popular nature of medical terms, we used local context, morphosyntactic, morphological and statistical information to design a learner which accurately detects scientific medical terms. This study is the first step towards the automatic replacement of a scientific term by its popular counterpart, which should have a beneficial effect on readability. We show an F-score of 84% for the prediction of scientific terms in an English and Dutch EPAR corpus. Since recasting the term extraction problem as a classification problem leads to a large skewedness of the resulting data set, we rebalanced the data set through the application of some simple TF-IDF-based and Log-likelihood-based filters. We show that filtering indeed has a beneficial effect on the learner's perf...
Véronique Hoste, Els Lefever, Klaar Vanopst
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2008
Where LREC
Authors Véronique Hoste, Els Lefever, Klaar Vanopstal, Isabelle Delaere
Comments (0)