Active learning for biomedical citation screening

14 years 2 months ago

Download tuftscaes.org

Active learning (AL) is an increasingly popular strategy for mitigating the amount of labeled data required to train classiﬁers, thereby reducing annotator eﬀort. We describe a real-world, deployed application of AL to the problem of biomedical citation screening for systematic reviews at the Tufts Evidence-based Practice Center. We propose a novel active learning strategy that exploits a priori domain knowledge provided by the expert (speciﬁcally, labeled features) and extend this model via a Linear Programming algorithm for situations where the expert can provide ranked labeled features. Our methods outperform existing AL strategies on three real-world systematic review datasets. We argue that evaluation must be speciﬁc to the scenario under consideration. To this end, we propose a new evaluation framework for ﬁnite-pool scenarios, wherein the primary aim is to label a ﬁxed set of examples rather than to simply induce a good predictive model. We use a method from medical...

Byron C. Wallace, Kevin Small, Carla E. Brodley, T

Real-time Traffic

Active Learning | Citation Screening | Data Mining | KDD 2010 | Systematic Review |

claim paper

Post Info
More Details (n/a)

Added	12 Oct 2010
Updated	12 Oct 2010
Type	Conference
Year	2010
Where	KDD
Authors	Byron C. Wallace, Kevin Small, Carla E. Brodley, Thomas A. Trikalinos

Comments (0)

Sciweavers

Active learning for biomedical citation screening

Active Learning | Citation Screening | Data Mining | KDD 2010 | Systematic Review |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers