Sciweavers

BMCBI
2004

Enhanced protein domain discovery using taxonomy

13 years 11 months ago
Enhanced protein domain discovery using taxonomy
Background: It is well known that different species have different protein domain repertoires, and indeed that some protein domains are kingdom specific. This information has not yet been incorporated into statistical methods for finding domains in sequences of amino acids. Results: We show that by incorporating our understanding of the taxonomic distribution of specific protein domains, we can enhance domain recognition in protein sequences. We identify 4447 new instances of Pfam domains in the SP-TREMBL database using this technique, equivalent to the coverage increase given by the last 8.3% of Pfam families and to a 0.7% increase in the number of domain predictions. We use PSI-BLAST to cross-validate our new predictions. We also benchmark our approach using a SCOP test set of proteins of known structure, and demonstrate improvements relative to standard Hidden Markov model techniques. Conclusions: Explicitly including knowledge about the taxonomic distribution of protein domains ca...
Lachlan James M. Coin, Alex Bateman, Richard Durbi
Added 16 Dec 2010
Updated 16 Dec 2010
Type Journal
Year 2004
Where BMCBI
Authors Lachlan James M. Coin, Alex Bateman, Richard Durbin
Comments (0)