Sciweavers

BMCBI
2008

Identification and correction of abnormal, incomplete and mispredicted proteins in public databases

13 years 11 months ago
Identification and correction of abnormal, incomplete and mispredicted proteins in public databases
Background: Despite significant improvements in computational annotation of genomes, sequences of abnormal, incomplete or incorrectly predicted genes and proteins remain abundant in public databases. Since the majority of incomplete, abnormal or mispredicted entries are not annotated as such, these errors seriously affect the reliability of these databases. Here we describe the MisPred approach that may provide an efficient means for the quality control of databases. The current version of the MisPred approach uses five distinct routines for identifying abnormal, incomplete or mispredicted entries based on the principle that a sequence is likely to be incorrect if some of its features conflict with our current knowledge about protein-coding genes and proteins: (i) conflict between the predicted subcellular localization of proteins and the absence of the corresponding sequence signals; (ii) presence of extracellular and cytoplasmic domains and the absence of transmembrane segments; (ii...
Alinda Nagy, Hédi Hegyi, Krisztina Farkas,
Added 09 Dec 2010
Updated 09 Dec 2010
Type Journal
Year 2008
Where BMCBI
Authors Alinda Nagy, Hédi Hegyi, Krisztina Farkas, Hedvig Tordai, Evelin Kozma, László Bányai, László Patthy
Comments (0)