Sciweavers

BMCBI
2008

Accelerating the annotation of sparse named entities by dynamic sentence selection

13 years 11 months ago
Accelerating the annotation of sparse named entities by dynamic sentence selection
This paper presents an active learning-like framework for reducing the human effort for making named entity annotations in a corpus. In this framework, the annotation work is performed as an iterative and interactive process between the human annotator and a probabilistic named entity tagger. At each iteration, sentences that are most likely to contain named entities of the target category are selected by the probabilistic tagger and presented to the annotator. This iterative annotation process is repeated until the estimated coverage reaches the desired level. Unlike active learning approaches, our framework produces a named entity corpus that is free from the sampling bias introduced by the active strategy. We evaluated our framework by simulating the annotation process using two named entity corpora and show that our approach could drastically reduce the number of sentences to be annotated when applied to sparse named entities.
Yoshimasa Tsuruoka, Jun-ichi Tsujii, Sophia Anania
Added 09 Dec 2010
Updated 09 Dec 2010
Type Journal
Year 2008
Where BMCBI
Authors Yoshimasa Tsuruoka, Jun-ichi Tsujii, Sophia Ananiadou
Comments (0)