Moara: a Java library for extracting and normalizing gene and protein mentions

15 years 8 months ago

Download www.biomedcentral.com

Background: Gene/protein recognition and normalization are important preliminary steps for many biological text mining tasks, such as information retrieval, protein-protein interactions, and extraction of semantic information, among others. Despite dedication to these problems and effective solutions being reported, easily integrated tools to perform these tasks are not readily available. Results: This study proposes a versatile and trainable Java library that implements gene/protein tagger and normalization steps based on machine learning approaches. The system has been trained for several model organisms and corpora but can be expanded to support new organisms and documents. Conclusions: Moara is a flexible, trainable and open-source system that is not specifically orientated to any organism and therefore does not requires specific tuning in the algorithms or dictionaries utilized. Moara can be used as a stand-alone application or can be incorporated in the workflow of a more genera...

Mariana L. Neves, José María Carazo,

Real-time Traffic

Biological Text Mining | BMCBI 2010 | Normalization Tasks | Text Mining Tasks |

claim paper

Post Info
More Details (n/a)

Added	08 Dec 2010
Updated	08 Dec 2010
Type	Journal
Year	2010
Where	BMCBI
Authors	Mariana L. Neves, José María Carazo, Alberto D. Pascual-Montano

Comments (0)

Sciweavers

Moara: a Java library for extracting and normalizing gene and protein mentions

Biological Text Mining | BMCBI 2010 | Normalization Tasks | Text Mining Tasks |

Explore & Download

Productivity Tools

Sciweavers