EVEREST: automatic identification and classification of protein domains in all protein sequences

15 years 6 months ago

Download www.biomedcentral.com

Background: Proteins are comprised of one or several building blocks, known as domains. Such domains can be classified into families according to their evolutionary origin. Whereas sequencing technologies have advanced immensely in recent years, there are no matching computational methodologies for large-scale determination of protein domains and their boundaries. We provide and rigorously evaluate a novel set of domain families that is automatically generated from sequence data. Our domain family identification process, called EVEREST (EVolutionary Ensembles of REcurrent SegmenTs), begins by constructing a library of protein segments that emerge in an all vs. all pairwise sequence comparison. It then proceeds to cluster these segments into putative domain families. The selection of the best putative families is done using machine learning techniques. A statistical model is then created for each of the chosen families. This procedure is then iterated: the aforementioned statistical mo...

Elon Portugaly, Amir Harel, Nathan Linial, Michal

Real-time Traffic

BMCBI 2006 | Domain Families | EVEREST | Protein |

claim paper

» ProtoMap automatic classification of protein sequences and hierarchy of protein families

» Touring protein fold space with DaliFSSP

» JACOP A simple and robust method for the automated classification of protein sequences wit...

» GeneRAGE a robust algorithm for sequence clustering and domain detection

» DoBo Protein domain boundary prediction by integrating evolutionary signals and machine le...

» Classification of protein quaternary structure by functional domain composition

» A functional hierarchical organization of the protein sequence space

» Improved general regression network for protein domain boundary prediction

Post Info
More Details (n/a)

Added	10 Dec 2010
Updated	10 Dec 2010
Type	Journal
Year	2006
Where	BMCBI
Authors	Elon Portugaly, Amir Harel, Nathan Linial, Michal Linial

Comments (0)

Sciweavers

EVEREST: automatic identification and classification of protein domains in all protein sequences

BMCBI 2006 | Domain Families | EVEREST | Protein |

Explore & Download

Productivity Tools

Sciweavers