A graph-search framework for associating gene identifiers with documents

14 years 24 days ago

Download www.cs.cmu.edu

Background: One step in the model organism database curation process is to find, for each article, the identifier of every gene discussed in the article. We consider a relaxation of this problem suitable for semi-automated systems, in which each article is associated with a ranked list of possible gene identifiers, and experimentally compare methods for solving this geneId ranking problem. In addition to baseline approaches based on combining named entity recognition (NER) systems with a "soft dictionary" of gene synonyms, we evaluate a graph-based method which combines the outputs of multiple NER systems, as well as other sources of information, and a learning method for reranking the output of the graph-based method. Results: We show that named entity recognition (NER) systems with similar F-measure performance can have significantly different performance when used with a soft dictionary for geneId-ranking. The graph-based approach can outperform any of its component NER s...

William W. Cohen, Einat Minkov

Real-time Traffic

BMCBI 2006 | Entity Recognition | NER Systems | Systems |

claim paper

Post Info
More Details (n/a)

Added	10 Dec 2010
Updated	10 Dec 2010
Type	Journal
Year	2006
Where	BMCBI
Authors	William W. Cohen, Einat Minkov

Comments (0)

Sciweavers

A graph-search framework for associating gene identifiers with documents

BMCBI 2006 | Entity Recognition | NER Systems | Systems |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers