Identification and correction of abnormal, incomplete and mispredicted proteins in public databases

15 years 8 months ago

Download www.biomedcentral.com

Background: Despite significant improvements in computational annotation of genomes, sequences of abnormal, incomplete or incorrectly predicted genes and proteins remain abundant in public databases. Since the majority of incomplete, abnormal or mispredicted entries are not annotated as such, these errors seriously affect the reliability of these databases. Here we describe the MisPred approach that may provide an efficient means for the quality control of databases. The current version of the MisPred approach uses five distinct routines for identifying abnormal, incomplete or mispredicted entries based on the principle that a sequence is likely to be incorrect if some of its features conflict with our current knowledge about protein-coding genes and proteins: (i) conflict between the predicted subcellular localization of proteins and the absence of the corresponding sequence signals; (ii) presence of extracellular and cytoplasmic domains and the absence of transmembrane segments; (ii...

Alinda Nagy, Hédi Hegyi, Krisztina Farkas,

Real-time Traffic

Abnormal | BMCBI 2008 | MisPred Approach | Proteins |

claim paper

Post Info
More Details (n/a)

Added	09 Dec 2010
Updated	09 Dec 2010
Type	Journal
Year	2008
Where	BMCBI
Authors	Alinda Nagy, Hédi Hegyi, Krisztina Farkas, Hedvig Tordai, Evelin Kozma, László Bányai, László Patthy

Comments (0)

Sciweavers

Identification and correction of abnormal, incomplete and mispredicted proteins in public databases

Abnormal | BMCBI 2008 | MisPred Approach | Proteins |

Explore & Download

Productivity Tools

Sciweavers